A Study on Generative Automatic Data Augmentation Based on Text-Driven Attribute Manipulation

Masatoshi Sekine

5:30 PM - 5:50 PM

[2L6-OS-19b-01] A Study on Generative Automatic Data Augmentation Based on Text-Driven Attribute Manipulation

〇Masatoshi Sekine¹, Daisuke Shimbara¹, Tomoyuki Myojin¹, Eri Imatani¹ (1. Hitachi, Ltd.)

Keywords:dataset quality, generative AI, attribute manipulation, automatic data augmentation, policy optimization

Unlike conventional software, AI software is developed inductively from training data. Therefore, preparing high-quality training data is crucial. Conventional automatic data augmentation methods primarily perform augmentation by directly manipulating the original data through means such as rotation and cropping, or by manipulating latent variables corresponding to the original data. These methods do not optimize data augmentation by manipulating various interpretable attribute information within the dataset. In this paper, we propose an automatic data augmentation method that generates new data by representing the attribute values of the original dataset in a text format. This method manipulates these attribute values to ensure data sufficiency and coverability by attribute value. Our proposed method optimizes data augmentation by learning how to manipulate textual attributes in ways that maximize the classification accuracies by attribute values and the naturalness of the textual data. This approach is expected to improve the overall quality of the dataset. We plan to implement and evaluate our proposed method to verify its effectiveness.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Presentation information

[2L6-OS-19b] OS-19

[2L6-OS-19b-01] A Study on Generative Automatic Data Augmentation Based on Text-Driven Attribute Manipulation

Password