4:45 PM - 5:00 PM
[E-06] On the Relation between the NTT Kanji data dictionary and Wikipedia
Keywords:word2vec, semantics, vector space models
The NTT Kanji database (Aman and Kondo, 1999) is one of the most popular kanji datasets. However, it has been long ever since it published. The word2vec (Mikolov, et al., 213) was proposed based on large vocabulary dataset. In spite of their popularity, any comparisons have not been tried so far. We tried to figure out differences between them in terms of several ways.
Abstract password authentication.
Password is required to view the abstract. Please enter a password to authenticate.