JSAI2021

Presentation information

General Session

General Session » GS-5 Language media processing

[4J4-GS-6g] 言語メディア処理:応用

Fri. Jun 11, 2021 3:40 PM - 5:20 PM Room J (GS room 5)

座長:欅 惇志((株)デンソーアイティーラボラトリ)

4:00 PM - 4:20 PM

[4J4-GS-6g-02] An Approach to the Study of English and American Literature Using Text Mining and Word Vectors

Interpreting Herman Melville's Typee from the Perspective of Natural Language Processing

〇Jun Kaneko1, Takashi Otsuki1, Takayuki Sakaguchi1 (1. Yamagata University)

Current affiliation of the first author: Mie University

Keywords:Text Mining, Natural Language Processing, English & American Literature, Word2vec, spaCy

English and American literature has so far been studied using stylistics from Natural Language Processing, but here it tried to be interpreted from that perspective. The words that writers used in their works are sometimes slightly different in nuance from that used by people in general. The differences in the meanings of words were viewed not sensitively (qualitatively) but quantitatively understood. Herman Melville’s first novel, Typee, was analyzed here. By comparing word vectors of general linguistic data (pretrained model of fastText) with that from texts written by the writer (trained by fastText), the writer’s sense of language was revealed. Based on what has been already studied so far in terms of American Literature, some important words were selected. Therefore, it was confirmed that the differences in the meanings of words were quantitatively demonstrated by the words indicated by their degree of similarity.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password