JSAI2024

Presentation information

Organized Session

Organized Session » OS-19

[2L6-OS-19b] OS-19

Wed. May 29, 2024 5:30 PM - 7:10 PM Room L (Room 52)

オーガナイザ:磯部 祥尚(産業技術総合研究所)、中島 震(放送大学・国立情報学研究所)、小林 健一(富士通株式会社)

5:30 PM - 5:50 PM

[2L6-OS-19b-01] A Study on Generative Automatic Data Augmentation Based on Text-Driven Attribute Manipulation

〇Masatoshi Sekine1, Daisuke Shimbara1, Tomoyuki Myojin1, Eri Imatani1 (1. Hitachi, Ltd.)

Keywords:dataset quality, generative AI, attribute manipulation, automatic data augmentation, policy optimization

Unlike conventional software, AI software is developed inductively from training data. Therefore, preparing high-quality training data is crucial. Conventional automatic data augmentation methods primarily perform augmentation by directly manipulating the original data through means such as rotation and cropping, or by manipulating latent variables corresponding to the original data. These methods do not optimize data augmentation by manipulating various interpretable attribute information within the dataset. In this paper, we propose an automatic data augmentation method that generates new data by representing the attribute values of the original dataset in a text format. This method manipulates these attribute values to ensure data sufficiency and coverability by attribute value. Our proposed method optimizes data augmentation by learning how to manipulate textual attributes in ways that maximize the classification accuracies by attribute values and the naturalness of the textual data. This approach is expected to improve the overall quality of the dataset. We plan to implement and evaluate our proposed method to verify its effectiveness.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password