JSAI2025

Presentation information

Organized Session

Organized Session » OS-46

[3P1-OS-46a] OS-46

Thu. May 29, 2025 9:00 AM - 10:40 AM Room P (Room 801-2)

オーガナイザ:古崎 晃司(大阪電気通信大学),森田 武史(青山学院大学),黒川 茂莉(KDDI総合研究所),広田 航(ストックマーク)

10:20 AM - 10:40 AM

[3P1-OS-46a-04] Assessing Logical Inference Capabilities of Large Language Models Through RDF Schema Entailment Rules

〇Taichi Hosokawa1, Takeshi Morita1 (1. Aoyama Gakuin University)

Keywords:RDF Schema entailment rules, counterfactual knowledge, large language models, ontology, logical inference capability

Large language models (LLMs) have recently demonstrated remarkable performance across various language tasks. However, they continue to face significant challenges in logical inference, often depending on pre-trained knowledge rather than engaging in genuine inference processes. Furthermore, their ability to perform inference tasks within ontology-based frameworks remains underexamined. This study focuses on RDF Schema entailment rules, leveraging two types of knowledge datasets: real-world datasets constructed from Linked Open Data and counterfactual datasets created by systematically modifying real-world knowledge from multiple perspectives. We propose a novel evaluation methodology to assess the inference capabilities of LLMs using these datasets. In the experiments, LLMs were provided with prompts containing rules and knowledge to generate inference outputs. These outputs were evaluated using real-world and counterfactual knowledge based on precision, recall, and F1 scores. The findings revealed inference failures in rare knowledge structures and a reliance on resource name patterns, underscoring the limitations of LLMs in inference with RDF Schema entailment rules.

Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password