[3Xin2-103] Dataset Development of Vision-Language Model for Patent Data
Keywords:Patent, Vision-Language Model, Image Recognition, Multi-Modal, Patent Figure
In this study, we developed a dataset for the development of image-language models of text-drawing pairs in patent documents. Specifically, we created a large image-language dataset by mapping patent drawings to explanatory text using standardized expressions in patents.
Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.