4:00 PM - 4:20 PM
[1G4-OS-26a-04] The 2nd International Knowledge Graph Reasoning Challenge: Application of LLM to Predicting Behavior from Multimodal Data about Daily Life
[1G5-OS-26b] 日常生活知識とAI 17:40 〜 18:00 にて発表
Keywords:Knowledge Graph, MultiModalLLM, Video recognition
The unique feature of this challenge is that it provides data with a missing part of the knowledge graph, and it is necessary to compensate for the missing information by extracting information from the video and predicting using machine learning on the knowledge graph.
In this presentation, we provide an overview of the dataset and tasks of this inference challenge and introduce the four submissions. Since several of the submissions used multimodal LLMs, we will compare them and also discuss the challenges and expectations for current multimodal LLMs in this task.
Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.