JSAI2020

Presentation information

General Session

General Session » J-2 Machine learning

[4J2-GS-2] Machine learning: Clustering

Fri. Jun 12, 2020 12:00 PM - 1:20 PM Room J (jsai2020online-10)

座長:中村篤祥(北海道大学)

12:00 PM - 12:20 PM

[4J2-GS-2-01] Learning Unlabeled-Unlabeled Classifier Using Nearest Neighbor Distance Ensemble

〇Mizuki Matsumoto1, Takashi Washio1 (1. Osaka University)

Keywords:Unsupervised Learning, Semi-supervised Learning, Classification from Unlabeled Data Only, Ensemble Estimation, Nearest Neighbor Distance

Most of instances in big-data are unlabeled, and the number of the available labeled instances are so limited that semi-supervised learning approaches are not effectively applied in many cases. This fact is one of the main obstacles for effective use of the big-data. In this study, we propose a novel approach to highly efficiently learn an accurate binary classifier from two given unlabeled data sets only. The approach classifies a given instance based on ensemble difference between its nearest-neighbor distances in the two unlabeled data sets. It provides consistent classification results within a constant computation time based on its mathematical background nature. Numerical experiments show high accuracies of the approach close to their upper bounds provided by Bayes error rates.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password