JSAI2023

Presentation information

General Session

General Session » GS-7 Vision, speech media processing

[1O5-GS-7] Vision, speech media processing

Tue. Jun 6, 2023 5:00 PM - 7:00 PM Room O (E1+E2)

座長:真矢 滋(東芝) [現地]

5:40 PM - 6:00 PM

[1O5-GS-7-03] Study of a Specialized Model of Object Detection Considering Distance and Angle between Camera and Objects

〇Keisuke Maesako1, Liang Zhang1 (1. SoftBank Corp.)

Keywords:object detection, Aerial image, YOLO

Recently, aerial communication platforms that can provide communication services from aircraft have been attracting attention. We are considering a new service by using object detection technology on aerial images taken with an additional onboard camera. However, object detection by using aerial images has a problem in that the object's appearance changes depending on the positional relationship between the camera and the object. To solve this issue, we proposed to create a specialized object detection model by classifying patterns of images taken using automobiles as an example and training the model for each pattern. Performance evaluation confirmed that the specialized object detection model can achieve more than 30% higher Average Precision compared with using the general-purpose model. Therefore, we can expect to improve the accuracy of object detection in aerial images by using an appropriate specialized object detection model according to the positional relationship between the camera and the object.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password