JSAI2023

Presentation information

General Session

General Session » GS-3 Knowledge utilization and sharing

[3R1-GS-3] Knowledge utilization and sharing

Thu. Jun 8, 2023 9:00 AM - 10:40 AM Room R (602)

座長:森田 武史(青山学院大学) [現地]

10:00 AM - 10:20 AM

[3R1-GS-3-04] Generation of word embeddings for Japanese word sense disambiguate using paragraph embeddings in front and behind the target

〇Taiyo Maehara1, Yoichi Takenaka1 (1. Kansai University)

Keywords:word sense disambiguate, word embeddings, BERT

In recent years, using "word embeddings," in which vectors represent word meanings, has made it easier for computers to handle language meanings. However, Word Sense Disambiguation remains an issue for polysemous words.
Word Sense Disambiguation determines which sense a polysemous word is used in a sentence. It is an essential task for computers to handle the meaning of language. For Japanese Word Sense Disambiguation, we propose a method to generate word embeddings of words so that the variance between clusters of different word senses is larger and the variance within each cluster is smaller. Our proposed model uses data before and after the target paragraph. The data is paragraphs before and after the target paragraph. We generated word embeddings of five targets word with conventional and our proposed methods, We compare existing and our proposed method for verification. We evaluate the inter-cluster and intra-claster variance and conduct the overall evaluation.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password