JSAI2024

Presentation information

Poster Session

Poster session » Poster session

[4Xin2] Poster session 2

Fri. May 31, 2024 12:00 PM - 1:40 PM Room X (Event hall 1)

[4Xin2-17] A study of Automatic scoring model for essay-type questions using a small dataset expansion method with The Back Translation

〇Yuto Kasuga1, Shoichi Urano1 (1.Meiji University)

Keywords:deep learning, natural language processing, BERT

With the introduction of essay-type questions in The Common Test for University Admissions in Japan being considered, there is a growing demand for essay-type questions to test students' thinking ability. Although there have been many previous studies on the use of natural language processing to construct an automatic scoring model to reduce the cost of scoring, the feasibility of such a model in actual scoring situations is low because it requires a large data set to construct a highly accurate model. In this study, we expand the dataset by using The Back Translation method on the English dataset obtained by translating the dataset of answers written in Japanese into English. Then, we fine-tune the English pre-training model of BERT using the dataset obtained in this way, aiming to output a distributed representation that reflects the meaning of the answers. Finally, by building a classifier that outputs scores based on the distributed representations, we consider building a highly accurate automatic scoring model for essay-type questions from a small dataset.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password