JSAI2023

Presentation information

International Session

International Session » IS-2 Machine learning

[1U5-IS-2b] Machine learning

Tue. Jun 6, 2023 5:00 PM - 6:40 PM Room U (Online)

Chair: Rafal Rzepka (Hokkaido university)

6:00 PM - 6:20 PM

[1U5-IS-2b-04] Improving Financial Terminologies Recognition regarding Morphological Inflection

〇Ziwei XU1, Rungsiman Nararatwong1, Natthawut Kertkeidkachorn3, Ryutaro Ichise 2,1 (1. National Institute of Advanced Industrial Science and Technology,Japan, 2. Tokyo Institute of Technology, 3. Japan Advanced Institute of Science and Technology)

[[Online, Regular]]

Keywords:transformer, morphology, finance

Recognizing financial terminologies from text is essential for key information retrieval and content understanding. In general, financial terminologies do not appear in single-token form but are composed of several tokens. Also, in terminologies, a proper name might have diverse expressions, like abbreviations and morphological inflection, which sacrifice the recognition performance on recall. In this paper, along with transformer-based language models, i.e. XLM-Roberta, we propose a mechanism to train the neural classifier to distinguish terminologies from plain text, by learning from the sequential tags of targeting tokens. Initially, the targeting tokens are from a list of terminologies. To involve the diverse expressions, we inventively generate different morphologies of terminologies and utilize them to extend the targeting tokens. The experiments' results prove that this mechanism shows a convincing improvement in identifying financial terms from plain text.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password