JSAI2024

Presentation information

General Session

General Session » GS-5 Language media processing

[2G5-GS-6] Language media processing:

Wed. May 29, 2024 3:30 PM - 5:10 PM Room G (Room 22+23)

座長:牧田光晴(LINEヤフー株式会社/SB Intuitions株式会社)

4:10 PM - 4:30 PM

[2G5-GS-6-03] Generating Japanese Puns via Paraphrasing Using a Language Model Fine-Tuned with Furigana-Annotated Corpus

〇Tomohito Minami1, Yuichi Sei1, Yasuyuki Tahara1, Akihiko Ohsuga1 (1. The University of Electro-Communications)

Keywords:AI, Natural Language Generation, Large Language Model, Humor

Puns, a form of wordplay, involve the creation of sentences by combining phonetically similar but semantically different words. Generating puns requires a deep understanding of word meanings and pronunciations. In this study, we develop a model that can transform Japanese sentences into puns without altering their original meanings by fine-tuning pre-trained language models to focus on Japanese phonetics. Leveraging corpora annotated with furigana and a pun database, we enhance the language model's ability to grasp Japanese phonetic nuances and its capacity to generate puns. Comparative experimental results reveal that our method, in contrast to models fine-tuned solely with a pun dataset without considerations for phonetic comprehension, achieves a 0.035-point increment in the BLEU score.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password