JSAI2022

Presentation information

General Session

General Session » GS-5 Language media processing

[4D3-GS-6] Language media processing: applications

Fri. Jun 17, 2022 2:00 PM - 3:40 PM Room D (Room D)

座長:伊藤 友貴(三井物産)[現地]

2:20 PM - 2:40 PM

[4D3-GS-6-02] Cluster Labeling for Patent Panoramic Analysis

〇Kana Ozaki1, Yasuhiro Sogawa1 (1. Research & Development Group, Hitachi, Ltd.)

Keywords:Cluster Labeling, Natural Language Processing, Patent

Patent data is important for companies to grasp the technology trends and the status of competitors. Patent panoramic analysis supports companies to position themselves in the market and grasp the trends of competitors, because it enables users to overview patents in a specified technical field by clustering and visualizing them. In each cluster, the users can capture the technical features of the patents by showing them cluster labels (representative keywords in the cluster). However, cluster labels often overlap among the clusters when we try to assign labels which consists of high frequency words in each cluster. This is because words that appear frequently in one cluster also appear frequently in other clusters when we cluster patents in a specified technical field. To tackle this challenge, we propose to use phrases as cluster labels and extract optimal combinations of the phrases which are discriminative among the clusters. In the experiment, we evaluate the effectiveness of our proposed method by using patent datasets.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password