JSAI2020

Presentation information

General Session

General Session » J-10 Vision, speech

[2Q6-GS-10] Vision, speech: Image analysis and application

Wed. Jun 10, 2020 5:50 PM - 7:30 PM Room Q (jsai2020online-17)

座長:秋元康佑(NEC)

6:50 PM - 7:10 PM

[2Q6-GS-10-04] The effect of image resolution on inter-object relationship recognition

〇Ikuto Kurosawa1, Tetsunori Kobayashi1, Yoshihiko Hayashi1 (1. Waseda University)

Keywords:inter-object relationship recognition , resolution of feature maps, scene graph

Recognition of the relationship between objects is a task of predicting the positional relationship or the action relationship that could hold between a pair of objects depicted in an image.
In this task, the visual feature of an object is required: feature maps obtained by CNN's convolutional layer and pooling layer are generally used.
However, they are considered insufficient to capture detailed information, such as the position of the human hand and the posture of the object, due to their low resolution, which is defined by the height and the width.
To address this issue, we propose to adopt Feature Pyramid Network (FPN) to increase the resolution of the feature map.
The experimental results proved that the use of the FPN was most effective in increasing the resolutions, contributing to improve the performance of the recognition of inter-object relationship recognition.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password