JSAI2022

Presentation information

Organized Session

Organized Session » OS-13

[1J4-OS-13a] 医療言語処理の拡張と連携(1/2)

Tue. Jun 14, 2022 2:20 PM - 4:00 PM Room J (Room J)

オーガナイザ:矢田 竣太郎(奈良先端科学技術大学院大学)[現地]、荒牧 英治(奈良先端科学技術大学院大学)、河添 悦昌(東京大学)

3:00 PM - 3:20 PM

[1J4-OS-13a-03] Information Extraction from Japanese Case Report Corpus for Structuring Clinical Texts

Daisaku Shibata1, 〇Yoshimasa Kawazoe1, Emiko Shinohara1, Kiminori Shimamoto1 (1. Graduate School of Medicine, The University of Tokyo)

Keywords:Information Extraction, Natural Language Processing, Case Reports

[Background] Significant information related to symptoms and findings of the patients is often written in a free-text form in clinical texts. To utilize these texts, information extraction using Natural Language Processing is required. [Objective] In this study, we evaluated named entity recognition (NER) and relation extraction (RE) performances with machine learning methods. We utilized the Japanese Case report corpus, which has manually annotated 70 type of entities and 35 type of relations. [Method] This study utilized the aforementioned corpus containing 183 cases. Having pre-processed them, we finally used 182 cases consisting of 2,172 sentences. Furthermore, a machine learning model based on Bidirectional Encoder Representations from Transformers was used. [Result] The results revealed that the maximum micro-averaged F1 scores of NER and RE were 0.931 and 0.826, respectively. [Discussion] We obtained comparable results to previous studies. Hence, these results could be substantial accuracies as baselines.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password