JSAI2024

Presentation information

General Session

General Session » GS-5 Language media processing

[1G3-GS-6] Language media processing:

Tue. May 28, 2024 1:00 PM - 2:40 PM Room G (Room 22+23)

座長:赤間 怜奈(東北大学)

1:20 PM - 1:40 PM

[1G3-GS-6-02] An Analysis of Sound Symbolism by Voiced-Voiceless Contrast in Japanese Onomatopoeia Using Word Embeddings

〇Shunnosuke Motomura1, Yuki Kubo1, Yuji Nozaki1,2, Maki Sakamoto1,2 (1. Kansei AI Co.,Ltd, 2. The University of Electro-Communications)

Keywords:onomatopoeia, sound symbolism, word embedding, Word2Vec, fastText

It is known that Japanese onomatopoeia has an aspect that has a semantic contrast between voiced and voiceless consonants (e.g., giragira-kirakira) at the beginning of the word. In this paper, we analyzed the difference between voiced and voiceless consonants of onomatopoeia in a space of static word embeddings such as Word2Vec. In an experiment in which we classified onomatopoeia starting with voiced or voiceless consonants in the space of word embeddings, we got a classification accuracy of 0.84 at best, which showed the possibility that word embeddings have some information about the sound symbolism of voiced and voiceless consonants. In an experiment in which we compared the contrast of voiced-voiceless consonants and bipolar adjective pairs, we got a result that showed that particular adjective pairs (e.g., "beautiful-ugly", "bright-dark") have some correlations with the contrast of the consonants, and the correlations have some compatibility with results shown by previous research.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password