10:40 AM - 11:00 AM
[4N1-GS-3-03] A Large Scale Web-Based Study of Japanese Vocabulary Size Estimation Test
Based on Word Familiarity Database, Reiwa edition
[[Online]]
Keywords:word familiarity, vocabulary size estimation, usage analysis
We investigated word familiarity and constructed a Word Familiarity Database Reiwa edition, which consists of about 163,000 words. By selecting test words based on word familiarity, we can estimate the approximate number of vocabulary, simply by asking people to indicate whether or not they know a small number of words. Then, we created a vocabulary-size estimation test based on the Word Familiarity Database Reiwa edition, and have made it available on the Web since June 4, 2020. Nearly two years have passed since its release, and the total number of users has exceeded 70,000. In this paper, we introduce a method for selecting test words and propose a new method for vocabulary-size estimation. In addition, we analyze the results of vocabulary-size estimation using Web logs. In particular, we show how the vocabulary-size changes with age and how the released three tests differ.
Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.