JSAI2023

Presentation information

General Session

General Session » GS-7 Vision, speech media processing

[1O3-GS-7] Vision, speech media processing

Tue. Jun 6, 2023 1:00 PM - 2:40 PM Room O (E1+E2)

座長:田崎 豪(名城大学) [オンライン]

1:40 PM - 2:00 PM

[1O3-GS-7-03] The data augmentation for UAV river patrol AI using generated image by generative models such as Stable Diffusion

〇Yuta Takahashi1, Junichiro Fujii1, Masazumi Amakata1 (1. Yachiyo Engineering Co., Ltd.)

Keywords:UAV, River Patrol, Image Generation, Diffusion models

The data in the civil engineering field is less data with much variety. Drone river patrols fly vast river areas and detect illegal dumping, including general garbage, using AI. The patrol drones are not constantly flying, they are rarely captured by aerial images, and it is even more difficult to detect temporary illegal occupation. In previous studies, it has been confirmed that adding images taken on the ground with different angles of view to the learning data improves learning, but the number of images is required for training even if the images are taken on the ground. In order to improve the learning of the detection model, this study verified whether the image for data augmentation can be generated and learned by image generation AI such as Stable Diffusion.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password