JSAI2022

Presentation information

General Session

General Session » GS-7 Vision, speech media processing

[1O4-GS-7] Vision, speech media processing: detection / data set creation

Tue. Jun 14, 2022 2:20 PM - 4:00 PM Room O (Room 510)

座長:石原 賢太(NEC)[遠隔]

3:20 PM - 3:40 PM

[1O4-GS-7-04] Constructing A Dataset of Demonstrative Anaphora in Comic and An Estimation of Effectiveness Using Machine Learning Methods

〇Hidenori Yamato1, Makoto Okada1, Naoki Mori1 (1. Osaka Prefecture University)

Keywords:Anaphora Estimation, Comic Computing, Machine Learning, Dataset Construction

Since comics are multimodal creations consisting of pictures and dialogues, they are attracting attention in the understanding of human creations by artificial intelligence. In comics, identifying this relation in pictures and dialogues is an important task for artificial intelligence to understand the contents of comics, because it helps to understand the contents. In this study, we focus on the anaphoric relation in demonstratives in reference resolution for the purpose of understanding comics with artificial intelligence. We constructed a dataset on the anaphoric relation between demonstratives and their antecedents in comic dialogues, and applied the constructed dataset to machine learning methods to verify the possibility of correspondence analysis of indicatives in comics and the effectiveness of the dataset.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password