JSAI2023

Presentation information

General Session

General Session » GS-5 Language media processing

[1E5-GS-6] Language media processing

Tue. Jun 6, 2023 5:00 PM - 7:00 PM Room E (A2)

座長:高瀬 翔(LINE) [現地]

5:00 PM - 5:20 PM

[1E5-GS-6-01] Creating linguistic embedding space for odors

〇Toshiki Kawamoto1, Masaki Tashiro1, Takamichi Nakamoto1, Manabu Okumura1 (1. Tokyo Institute of Technology)

Keywords:NLP, Odor

To obtain a genuine meaning for a natural language sentence, it is necessary to understand the connection between words or phrases in a language and various kinds of real-world information. One of such real-world information might be odors. Previous studies investigated whether word embeddings from word2vec can acquire odor information. However, their model, trained with general corpora, does not have much odor information due to a small volume of corpora related to odors. In this paper, we propose TOLE, Thesaurus-enhanced Odor-adaptive Linguistic Embeddings. TOLE retains the odor information with domain adaptation and word-level contrastive learning on pre-trained language models. As a result, TOLE can improve the similarity between odor embeddings from odor descriptors and linguistic embeddings.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password