JSAI2019

Presentation information

General Session

General Session » [GS] J-13 AI application

[4P3-J-10] Vision, speech: image recognition learning

Fri. Jun 7, 2019 2:00 PM - 3:20 PM Room P (Front-left room of 1F Exhibition hall)

Chair:Takayoshi Yamashita Reviewer:Akisato Kimura

2:00 PM - 2:20 PM

[4P3-J-10-01] Application of Memory Reduction into Environmentally Invariant Regressor CNN

〇Mari Suzuki1, Makoto Masuda1 (1. Oki Electric Industry Co., Ltd.)

Keywords:image recognition, deep learning, low-rank approximation

In recent years, the need for computer vision applications such as object/scene classification has grown rapidly.
For example, tracking of vehicles and recognizing number plates can be used to automatically estimate traffic flow.
These applications require high environmental invariance (due to weather conditions, illumination changes, etc.) as well as compact design and fast rocessing speed in order to be deployed into edge devices.

In this paper, we consider the regression problem for vehicle's number plate recognition.
Recognition accuracy increases if letters/digits are geometrically transformed beforehand to fix distortions or mis-alignments.
However, using CNN to realize such regression task requires a large nunber of parameters.

We propose to use low-rank approximation and fine-tuning to reduce the size of such CNNs.
Experiments show that under various environments, our method is effective in greatly reducing memory consumption without significantly degrading performance.