JSAI2024

Presentation information

Poster Session

Poster session » Poster session

[3Xin2] Poster session 1

Thu. May 30, 2024 11:00 AM - 12:40 PM Room X (Event hall 1)

[3Xin2-44] Ordinal Regression by Combining Finely Labeled Data with Coarser-Labeled Data

〇Shinichiro Saito1, Takuji Tahara1 (1.Sansan, Inc.)

Keywords:Ordinal Regression, Hybrid Label, LightGBM

Ordinal regression is a classification problem with an order relationship between labels. In classification, including ordinal regression, finely labeled and coarser-labeled data are sometimes obtained because of different types of collection or time periods. Ideally, all obtained data can be used for training models, but it is impossible to train models directly using data with differently labeled granularity. We converted two different-granularity labels to soft labels based on a specific probability distribution. This enabled coarser-labeled data to be added to finely labeled data for training. We then performed experiments on two benchmark datasets to evaluate our proposed method. We found the method improves the performance of LightGBM in the ordinal regression problem by up to about 0.02 in accuracy and about 0.04 in mean absolute error. Additionally, a smaller probability distribution variance tends to improve accuracy, while a larger one tends to improve the mean absolute error.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password