4:10 PM - 4:30 PM
[2K5-GS-2-03] Generating a Wide Variety of Categorical Data Using Diffusion Models
Keywords:Tabular Data Generation, Diffusion Models, Categorical Data, Deep Learning
Diffusion Models, which have been frequently researched in the field of Computer Vision as a method that outperforms GANs, are not confined to that field, but are spreading to other fields as well. TabDDPM using Diffusion Models have also been proposed for tabular data generation, and its authors claimed it can generate highly accurate data. However, TabDDPM tends to generate similar data as the number of such categories increases, resulting in learning filures because it handles categorical data as one-hot vectors. In order to overcome that problem, in this paper, we propose Table BD (Table Bit Diffusion) by incorporating Bit Diffusion preprocessing method. In our experiments, Table BD can generate data with a larger number of categories than TabDDPM.
Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.