3:00 PM - 3:20 PM
[4Q3-GS-9-04] Information Extraction from Financial Documents by Syntactic Parsing and Table Analysis
Keywords:Natural Language processing, Information Extraction, Finance
In the financial domain, when an investor makes an investment decision, he/she reads the necessary information from documents disclosed in accordance with the issuance of share certificates and corporate bonds. The financial disclosure documents are published in XBRL format and consists of a plurality of text blocks and tables, where the necessary information are scattered in the form of natural language. Extracting the information from disclosure documents and managing it continuously with DB is desirable.However, the cost is expensive to extract by hand because of the large number of the documents consisting of about 40 to 60 items to be required. In this manuscript, we apply the natural language processing techniques to the disclosure document and report the result of the extraction of the necessary information by pattern matching of syntax tree and table analysis.
Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.