Presentation information

Organized Session

Organized Session » OS-3

[3J3-OS-3a] AutoML(自動機械学習)(1/2)

Thu. Jun 16, 2022 1:30 PM - 3:10 PM Room J (Room J)

オーガナイザ:大西 正輝(産業技術総合研究所)[現地]、日野 英逸(統計数理研究所/理化学研究所)

2:30 PM - 2:50 PM

[3J3-OS-3a-04] An Impact of Weight Initialization on Model Evaluations in Neural Architecture Search

〇Nozomu Yoshinari1, Shinichi Shirakawa1 (1. Yokohama National University)


Keywords:Weight Initialization, Neural Architecture Search

Architecture is one key factor determining neural networks' performance, and neural architecture search, which aims at finding competent architectures without human effort, is one of the most intensive research areas of automated machine learning. While most papers in the area focused only on architecture, recent research show performance of architecture depends on other hyperparameters such as learning rate, and simultaneous optimization of them is needed to obtain a better model. This research focuses on weight initialization methods and investigates their impact on the performance of architectures after training. Through experiments on the architectures defined in NAS-Bench-201, we found an initialization method considering architecture significantly improved the performance of many models.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.