JSAI2024

Presentation information

Poster Session

Poster session » Poster session

[3Xin2] Poster session 1

Thu. May 30, 2024 11:00 AM - 12:40 PM Room X (Event hall 1)

[3Xin2-103] Dataset Development of Vision-Language Model for Patent Data

〇Kazuya Ando1, Tsukito Mizoguchi1, Haruki Ishikawa1, Akira Iyoda1, Seiya Kawano2, Koichiro Yoshino2, Hirofumi Nonaka1 (1.Aichi Instetute Technology, 2.Guardian Robot Project, RIKEN)

Keywords:Patent, Vision-Language Model, Image Recognition, Multi-Modal, Patent Figure

In this study, we developed a dataset for the development of image-language models of text-drawing pairs in patent documents. Specifically, we created a large image-language dataset by mapping patent drawings to explanatory text using standardized expressions in patents.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password