JSAI2024

Presentation information

Organized Session

Organized Session » OS-2

[3K5-OS-2b] OS-2

Thu. May 30, 2024 3:30 PM - 4:50 PM Room K (Room 44)

オーガナイザ:鈴木 健二(ソニーグループ株式会社)、原 聡(大阪大学)、谷中 瞳(東京大学)、菅原 朔(国立情報学研究所)

4:30 PM - 4:50 PM

[3K5-OS-2b-04] Fine-Tuned Data Representation Models for Data Exploration in Data Markets

〇Kosuke Manabe1, Yukihisa Fujita2, Masahiro Kuwahara2, Teruaki Hayashi1 (1. The University of Tokyo, 2. Toyota Motor Corporation)

Keywords:Language Models, Contrastive Learning, Data Market

With the development of information technology for data collection, storage, and analysis, data collaboration and utilization in different fields are attracting attention. In this climate, data markets are emerging to exchange data across fields. However, data analysis experience and data format expertise are needed to explore and discover the data related to our interests in the data exchange platforms. In addition, current metadata is mainly created manually, and the consistency and interpretability of descriptions are highly dependent on the knowledge and ex- perience of the data providers. To address the above issues, we propose a method for learning a data representation model that takes data bodies as input and outputs embedded representations for data retrieval. As a result, we found that the proposed method can obtain a data representation that more accurately reflects the topic of the data than existing methods.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password