JSAI2023

Presentation information

Poster Session

General Session » Poster session

[4Xin1] Poster session 2

Fri. Jun 9, 2023 9:00 AM - 10:40 AM Room X (Exhibition hall B)

[4Xin1-03] Named entity recognition for corporate names in news text data and identification of the name by characteristics of neighboring words

〇Koutarou Tamura1, Akira Kitauchi2, Atsushi Takayama1,2 (1.Uzabase,Inc., 2.NewsPicks, Inc.)

Keywords:Natural Language Processing, Named-entity recognition, AI

We used huge news data distributed by SPEEDA service, which include not only news on general interest but also business and industry-specific topics, and built a model to extract corporate information appearing in the text as named entity. In this study, we proposed a method to extract the corporate names in the text after segmented by tokenizers and the extracted were matched with a corporate name dictionary added with their automatically-generated abbreviations and so on. Thereby, we succeed in extracting a named entity which is identified as a corporate name and the method improved the accuracy of the task of extracting corporate names and identification of the company.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password