JSAI2021

Presentation information

General Session

General Session » GS-9 Human interface

[3H2-GS-9b] ヒューマンインタフェース:ユーザ支援

Thu. Jun 10, 2021 11:00 AM - 12:40 PM Room H (GS room 3)

座長:田和辻 可昌(早稲田大学)

11:40 AM - 12:00 PM

[3H2-GS-9b-03] Research on Method Study of Lip Reading Technology Using Machine Learning

〇Shodai Yamaguchi1, Eri Sato-Shimokawara1, Akihiro Matsufuji1, Toru Yamaguchi1 (1. Tokyo Metropolitan University)

Keywords:Lip-Reading, Lip area

In order to solve the problem of voice recognition, which reduces the recognition rate in noisy places, we tried to realize a lip-reading technology that could recognize even without voice. Software dedicated to reading lips has been developed for English speakers, but it is considered difficult to apply it to Japanese, which has a small number of vowels. Therefore, in this study, we conducted an experiment targeting Japanese speakers. Data were collected by having 10 subjects speak 5 times for each of the 6 limited sentences. As a result of classifying the collected data by machine learning, the classification by k-Nearest Neighbor was the most accurate. However, at the same time, it became clear that the recognition accuracy differs depending on the subject. In the future, we would like to proceed with the development of recognition methods that take individual differences into consideration.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password