JSAI2024

Presentation information

Organized Session

Organized Session » OS-17

[4P3-OS-17c] OS-17

Fri. May 31, 2024 2:00 PM - 3:20 PM Room P (Room 401)

オーガナイザ:名取 直毅(株式会社アイシン)、梶 大介(株式会社デンソー)、廣瀬 正明(株式会社デンソー)、河村 芳海(トヨタ自動車株式会社)、梶 洋隆(トヨタ自動車株式会社)、城殿 清澄(株式会社豊田中央研究所)

3:00 PM - 3:20 PM

[4P3-OS-17c-04] Data Augmentation for Retriever Using Generative Model

〇Ayaka Yomogida1, Kazutada Ban1, Tetsuya Iida1, Fumihiko Murase1, Akira Mitani1, Takanori Takeno1, Toru Hirano2 (1. DENSO CORPORATION, 2. DENSO INTERNATIONAL AMERICA, INC.)

Keywords:Data Augmentation, Retiever, Generative Model

The Retrieval-Augmented-Generation (RAG) method is gaining attention for enhancing language generation ability in a specific domain. Although retriever using dense embedding, which have reported high accuracy, are critical for the overall accuracy, the challenge has been the high burden of preparing supervised data for the target domain. In this study, we considered data augmentation using the GPT-3.5 and clarified that queries that presuppose a document and queries without clear answers in the document . The proposed method confirmed the effect of improving the accuracy of the retriever with less data by removing such data from the generation results.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password