JSAI2025

Presentation information

General Session

General Session » GS-2 Machine learning

[3S1-GS-2] Machine learning:

Thu. May 29, 2025 9:00 AM - 10:40 AM Room S (Room 701-2)

座長:北岡 旦(NEC)

10:00 AM - 10:20 AM

[3S1-GS-2-04] Estimating Geometric Quantities of the Encorder's Latent Spaces

Analyzing CNNs and Transformers with Information Geometry

〇Ikumi Akatsuka1, Noboru Murata1 (1. Waseda University)

Keywords:machine learning, information geometry, latent space

Encoders such as CNNs and Transformers can embed high-dimensional objects (e.g., images) into low-dimensional vectors via an object embedding operation, and many previous studies treat the latent space formed by these embedding vectors as a Euclidean space. In this study, I aim to capture the geometric structure of the latent space that may be overlooked under a purely Euclidean assumption. To this end, we propose a method that associates the encoder’s intermediate representations with probability distributions, thereby defining an information-geometric manifold on which we can estimate geometric quantities such as metrics and curvature. The set of distributions obtained by inputting an image dataset into the encoder forms an information-geometric manifold with the α-divergence as its distance, and its expectation coordinates coincide with the embedding vectors. Through experiments estimating the metric and curvature of the MNIST dataset learned by a CNN, we found that the latent space exhibits positive curvature in many regions, indicating that it is not necessarily flat.

Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password