Investigation of Expert Knowledge Extraction Using Pre-trained Language Models

Seiya Asano

2:20 PM - 2:40 PM

[1E3-GS-6-05] Investigation of Expert Knowledge Extraction Using Pre-trained Language Models

〇Seiya Asano¹, Masaru Isonuma^1,2, Kimitaka Asatani¹, Misuzu Nomura³, Junichiro Mori^1,4, Ichiro Sakata¹ (1. The University of Tokyo, 2. The University of Edinburgh, 3. Daikin Industries, Ltd., 4. RIKEN)

Keywords:Pre-trained language models, Knowledge extraction

In recent years, there has been a lot of research focused on using language models instead of knowledge bases. Language models have many advantages compared to structured knowledge bases, such as not requiring manual definition of information attributes and relationships and being able to search more data in a more flexible and efficient manner. However, their performance is still developing, and there are still many hurdles to overcome, such as the inability to predict compound nouns. This study specifically focused on the knowledge of specialized compound nouns related to chemistry and investigated how accurately knowledge in a specific field could be extracted. Specifically, by using SciFive, which was further trained with T5 on biomedical papers, and by performing additional training on abstract data contained in Scopus, the study aimed to improve the accuracy of extracting specialized knowledge in chemistry. The results confirmed how accuracy changes depending on the amount of data used for additional training, with a decrease in accuracy with less data and an improvement in accuracy with relatively more data. These results demonstrate further potential for attempts to extract knowledge from language models.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Presentation information

[1E3-GS-6] Language media processing

[1E3-GS-6-05] Investigation of Expert Knowledge Extraction Using Pre-trained Language Models

Password