
Presentation information

General Session

General Session » GS-5 Language media processing

[2B5-GS-6] Language media processing: leaning / inference

Wed. Jun 15, 2022 3:20 PM - 5:00 PM Room B (Room C-1)

座長:竹岡 邦紘(NEC)[現地]

3:40 PM - 4:00 PM

[2B5-GS-6-02] Validity Judgment of Phrase Connectivity by Self-Supervised Learning for Change Point Detection

〇Ryota Morinaga1, Daiki Tamashiro1, Satoshi Ono1 (1. Kagoshima University)


Keywords:Natural Language Processing, Self-supervised Learning, Machine Learning

The recent rapid development of Deep Neural Networks (DNNs) has led to various technological innovations in Natural Language Processing (NLP). However, DNNs require a large amount of training data, and labeling supervised signals is the bottleneck in training data generation. For this reason, self-supervised learning (SSL), which generates supervised training data from unsupervised training data, has been attracting attention. On the other hand, there has been extensive research on proofreading support for Japanese texts, enabling the detection of superficial errors such as spelling and homonym errors. This study proposes an SSL-based method for validity judgment of phrase connectivity based on grammatical or semantic integrity. The proposed method synthesizes supervised training data by cutting and connecting two randomly selected phrases and assigns ground truth labels. Experimental results demonstrated the effectiveness of the proposed method in the NLP task.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.
