JSAI2024

Presentation information

Poster Session

Poster session » Poster session

[4Xin2] Poster session 2

Fri. May 31, 2024 12:00 PM - 1:40 PM Room X (Event hall 1)

[4Xin2-33] Deep Learning Approach for Transferable Tabular Data Analysis Based on Fusion of Features from Column Names and Values

〇Shintaro Yamamoto1, Jumpei Ando1, Wataru Watanabe1, Toshiyuki Ono1 (1.Toshiba Corporation)

Keywords:tabular data, deep learning, neural network

Tabular data analysis is a crucial technique in various fields, including manufacturing and social infrastructure. In real-world scenarios, columns of tabular data may differ between samples due to factors such as variations in data collection sources or the inclusion of additional data contents. Most methods for tabular data analysis assume that the columns of all samples are identical. Consequently, a data analyst must choose between extracting columns that are available in all samples or selecting samples that contain the same columns. To address tabular data with different columns, a method called TransTab has been proposed. However, TransTab overlooks the relationship between column names and categorical values, making it challenging to address samples with the same categorical values but different column names. To mitigate above mentioned issue, we propose a novel approach that fuses features from column names and values. Our method has demonstrated a minimum improvement of 16.1 points in terms of AUROC compared to that of TransTab.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password