JSAI2018

Presentation information

Oral presentation

General Session » [General Session] 2. Machine Learning

[1Z1] [General Session] 2. Machine Learning

Tue. Jun 5, 2018 1:20 PM - 3:00 PM Room Z (3F Matsu Take)

座長:大塚 琢馬(NTT)

2:40 PM - 3:00 PM

[1Z1-05] Bayesian Estimation and Model Averaging of CNN by Hypernet

〇Kenya Ukai1, Takashi Matsubara1, Kuniaki Uehara1 (1. Graduate School of System Infomatics, Kobe University)

Keywords:Hypernet, Bayesian estimation

Neural networks have rich ability to learn complex representations. However, due to the limited number of training samples, overfitting is likely to occur. Hence, it is essential to regularize the learning process of neural networks. In this paper, we propose a regularization method which estimates CNN's parameters as probabilistic distributions by using hypernet. Then, to make it applicable to a large model such as WideResNet, we used likelihood as loss function. Experimental results demonstrate the regularization of our method.