JSAI2025

Presentation information

Organized Session

Organized Session » OS-8

[1H3-OS-8a] OS-8

Tue. May 27, 2025 1:40 PM - 3:20 PM Room H (Room 1003)

オーガナイザ:中川 慧(野村アセットマネジメント),平野 正徳(Preferred Networks),坂地 泰紀(北海道大学),酒井 浩之(成蹊大学),水田 孝信(スパークス・アセット・マネジメント),星野 崇宏(慶應義塾大学)

2:40 PM - 3:00 PM

[1H3-OS-8a-04] Development of Company Similarities from Both the Textual and Numerical Types of Financial Data

〇Kenji Hiramatsu1, Tomoki Ito2 (1. IFIS JAPAN LTD., 2. MITSUI & CO., LTD.)

Keywords:Text Mining, Financial Documents, Embedding

Similarity among companies is important information that forms the basis of analysis in various financial practices, such as corporate valuation, investment and loan decisions, portfolio risk management, partner selection for business promotion, and in-house investor relations activities.

A useful tool for calculating the degree of similarity between companies is the embedded representation of companies, which can be obtained by using BERT and other methods on textual information.

While this embedding representation based on text data is effective, in the economic and financial fields, there are many numerical data that are expected to be useful for measuring the degree of similarity between companies. It is expected that combining these numerical financial data with textual data will enable us to search for more useful “similar companies”.

Therefore, this study proposes a method for searching similar companies utilizing both “textual information” and “numerical information.” Our methso utilizes not only textual information on stocks, but also numerical information such as sales by segment, stock price time-series data, and shareholder composition.

Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password