JSAI2020

Presentation information

General Session

General Session » J-9 Natural language processing, information retrieval

[1E3-GS-9] Natural language processing, information retrieval: Machine learning

Tue. Jun 9, 2020 1:20 PM - 3:00 PM Room E (jsai2020online-5)

座長:石畠正和(NTT)

1:20 PM - 1:40 PM

[1E3-GS-9-01] Optimization of Rank Order of Weighted City-block Distance

Toward Supervised Learning for Analogy Task using Word Vectors

〇Shohei Hidaka1 (1. Japan Advanced Institute of Science and Technology)

Keywords:Word embedding, Weighted distance, Optimization of rank order

Making analogical inference in vector space has become a standard method to test the quality of word vectors.
Typically the operator for the analogical inference is manually optimized for the task. In this study, we consider a systematic optimization of the metric-based rank order function for the analogical inference. If we directly evaluate the rank order function, one needs to process a few millions of word vectors every step of optimization. This causes a considerably large computational cost which makes a systematic optimization of such analogical inference intractable. In this study, we propose a theoretical approximation for this rank-order evaluation, and demonstrate an optimization of the analogical inference using the approximated evaluation. Lastly, we discuss about the ``parallelogram'' relationship, which may or may not have a deep connection with the well known ``analogy parallelogram'', revealed by the mathematical analysis of the probability of the distance-based rank order.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password