JSAI2023

Presentation information

General Session

General Session » GS-5 Language media processing

[3T1-GS-6] Language media processing

Thu. Jun 8, 2023 9:00 AM - 10:40 AM Room T (Online)

座長:梶原 智之(愛媛大学) [現地]

10:00 AM - 10:20 AM

[3T1-GS-6-04] Improving Method for Extracting Causality from Text with Multi-step Fine-tuning of GPT-3 using Wikidata

〇Taketo Ohira1, Shun Shiramatsu1 (1. Graduate School of Engineering, Nagoya Institute of Technology)

[[Online]]

Keywords:Information extract

Causal knowledge is necessary to develop a facilitator agent that can understand the points of discussion and the opinions of the participants. However, causal knowledge contained in Wikidata, a famous knowledge graph, is not sufficient. Therefore, in this study, we have attempted to extend the training data for GPT-3 re-training using Wikidata's causal knowledge as a method for cause extraction. As a result, we had confirmed that the accuracy was improved over the conventional method. In this paper, we hypothesized that multi-stage retraining, rather than mere data expansion, would improve accuracy, and verified this hypothesis through experiments. The results showed that multi-step re-training improved extraction accuracy compared to mere data expansion. Furthermore, we designed a "generality'' scale to determine whether the extracted causes are widely known to the general public, and confirmed the trend that the "generality'' of causal relationships that are widely known to the general public is higher.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password