JSAI2024

Presentation information

Organized Session

Organized Session » OS-11

[2M1-OS-11a] OS-11

Wed. May 29, 2024 9:00 AM - 10:40 AM Room M (Room 53)

オーガナイザ:花田 研太(舞鶴高専)、波多野 大督(理化学研究所)、宋 剛秀(神戸大学)

10:00 AM - 10:20 AM

[2M1-OS-11a-03] Introducing Constraints to Multilabel Object Detection and application to ROAD-R

〇Sota Moriyama1,2, Koji Watanabe1,3, Katsumi Inoue1,2,3, Akihiro Takemura1 (1. National Institute of Technology, 2. Tokyo Institute of Technology, 3. The Graduate University for Advanced Studies)

Keywords:Object Recognition, Boolean Satisfiability Problem, Constrains, Autonomous Driving

Detecting the actions of each object is detrimental to improving the usability of the model, but the risk of misrecognition increases as the number of label combinations increases. Therefore, we propose a framework that reduces the amount of misrecognition by utilizing the requirements that the set of labels has to satisfy. Specifically, we propose MODYOLO, a novel multilabel object detection model built upon the state-of-the-art object detection model YOLOv8, and develop our framework on top of it. We then assess the framework's effectiveness by applying it to the ROAD-R Challenge for NeurIPS 2023 competition. For Task 1, we introduce the Corrector Model and Blender Model, two new models that follow after the object detection process, aiming to generate a more constrained output. For Task 2, constrained losses have been incorporated into the training process of MODYOLO using Fuzzy Logic. The results show that using the above framework was instrumental to improving the scores for both Tasks 1 and 2, allowing us to place third and first in the subsequent tasks.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password