JSAI2025

Presentation information

General Session

General Session » GS-1 Fundamental AI, theory

[2L5-GS-1] Fundamental AI, theory, algorithm:

Wed. May 28, 2025 3:40 PM - 5:20 PM Room L (Room 1007)

座長:中臺一博(東京科学大学)

4:40 PM - 5:00 PM

[2L5-GS-1-04] Development of Structural Learning Methods for Language Models Using Algebraic Statistical Approaches

〇Akihiro Maeda1,3, Takuma Torii2, Shohei Hidaka1 (1. Japan Advanced Institute of Science and Technology, 2. Tokyo Denki University, 3. JSPS)

Keywords:Language model, Algebraic statistics, Model selection

Algebraic statistics, a fusion of algebraic geometry and statistics, represents probabilistic models as low-dimensional structures in high-dimensional spaces and captures their structures through ideals of polynomial rings. From this perspective, it is expected to naturally model complex constraints such as conditional independence and hierarchical structures in linguistic data. This study formulates probabilistic models of language as joint probability distributions of words composing sentences and applies algebraic statistical approaches. By characterizing the models through vanishing ideals that constrain the probabilistic structures, the study develops methods to select models that fit the data. Experiments using synthetic data demonstrated that the proposed method achieved performance comparable to information criteria and directly extracted the structural characteristics of the probabilistic model.

Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password