2023年度 人工知能学会全国大会(第37回)

講演情報

国際セッション

国際セッション » IS-2 Machine learning

[1U5-IS-2b] Machine learning

2023年6月6日(火) 17:00 〜 18:40 U会場 (遠隔)

Chair: Rafal Rzepka (Hokkaido university)

18:00 〜 18:20

[1U5-IS-2b-04] Improving Financial Terminologies Recognition regarding Morphological Inflection

〇Ziwei XU1, Rungsiman Nararatwong1, Natthawut Kertkeidkachorn3, Ryutaro Ichise 2,1 (1. National Institute of Advanced Industrial Science and Technology,Japan, 2. Tokyo Institute of Technology, 3. Japan Advanced Institute of Science and Technology)

[[Online, Regular]]

キーワード:transformer, morphology, finance

Recognizing financial terminologies from text is essential for key information retrieval and content understanding. In general, financial terminologies do not appear in single-token form but are composed of several tokens. Also, in terminologies, a proper name might have diverse expressions, like abbreviations and morphological inflection, which sacrifice the recognition performance on recall. In this paper, along with transformer-based language models, i.e. XLM-Roberta, we propose a mechanism to train the neural classifier to distinguish terminologies from plain text, by learning from the sequential tags of targeting tokens. Initially, the targeting tokens are from a list of terminologies. To involve the diverse expressions, we inventively generate different morphologies of terminologies and utilize them to extend the targeting tokens. The experiments' results prove that this mechanism shows a convincing improvement in identifying financial terms from plain text.

講演PDFパスワード認証
論文PDFの閲覧にはログインが必要です。参加登録者の方は「参加者用ログイン」画面からログインしてください。あるいは論文PDF閲覧用のパスワードを以下にご入力ください。

パスワード