JSAI2018

Presentation information

Oral presentation

General Session » [General Session] 3. Data Mining

[1P3] [General Session] 3. Data Mining

Tue. Jun 5, 2018 5:20 PM - 7:00 PM Room P (4F Emerald Lobby)

座長:岡田 将吾(北陸先端科学技術大学院大学)

5:40 PM - 6:00 PM

[1P3-02] Finding Representatives via Nearest Neighbor Based Binarization and Frequent Pattern Mining

〇Yuka Yoneda1, Mahito Sugiyama2, Takashi Washio1 (1. The Institute of Scientific and Industrial Research , 2. National Institute of Informatics)

Keywords:frequent pattern mining, binarization, representative data points

We propose to find representative data points from continuous data via a two-step procedure: We first binarize data points based on the nearest neighbor search, followed by performing frequent pattern mining on the binarized data. Since frequent patterns correspond to combinations of data points shared by many other data points as their neighbors, they are expected to well summarize the entire dataset. We empirically show that representative data points detected by our method have competitive quality with random sampling in the classification scenario.