JSAI2022

Presentation information

General Session

General Session » GS-5 Language media processing

[4D3-GS-6] Language media processing: applications

Fri. Jun 17, 2022 2:00 PM - 3:40 PM Room D (Room D)

座長:伊藤 友貴(三井物産)[現地]

2:00 PM - 2:20 PM

[4D3-GS-6-01] A method of selecting a sentence with puns reflecting what is in the picture

〇Reki Asano1, Motoki Yatsu1, Takeshi Morita1 (1. Aoyama Gakuin University)

Keywords:Image captioning, Humor

When a social robot makes an utterance based on the surrounding situation obtained from image input, it is considered that humor such as puns can improve its entertainment aspect. Therefore, we propose a ranking method for selecting puns. In the proposed method, plain captions are generated from the Japanese caption generation model learned from the STAIR Captions dataset, and important words and other morphemes are extracted from the obtained captions. The words obtained in this way are weighted so that the words resulting from object detection and important word extraction have larger values. As the output, the method selects a pun sentence in pun database that maximizes sum of the weights. In the subjective evaluation experiment, the proposed method selected puns for each of 10 images randomly selected from the MS COCO dataset. We asked 10 subjects whether the puns seem appropriate for the images and to make 5-point evaluations. As a result, the average evaluation value was 3.11 for the question, which was slightly higher than "neither".

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password