The 81st JSAP Autumn Meeting, 2020

Presentation information

Oral presentation

23 Joint Session N "Informatics" » 23.1 Joint Session N "Informatics"

[9p-Z09-1~18] 23.1 Joint Session N "Informatics"

Wed. Sep 9, 2020 1:00 PM - 6:00 PM Z09

Kiyou Shibata(the University of Tokyo), Masato Kotsugi(Tokyo Univ. of Sci.), Shigetaka Tomiya(SONY Corp.)

5:00 PM - 5:15 PM

[9p-Z09-15] Supercuration: A machine-assisted data curation tool for rapid database construction for materials informatics

Luca Foppiano1, Masashi Ishii1 (1.Material Database Group, MaDIS, NIMS)

Keywords:text mining, materials informatics, superconductors

The establishment of Text and Data Mining (TDM) processes is required by Materials Informatics (MI) to accelerate toward data-driven material discovery. Although a lot of effort has been made for this establishment, full automatic TDM processes are still challenging. The National Institute for Materials Science (NIMS) is constructing several databases for MI, and SuperCon is a hopeful data source in superconductor domain. While we are implementing an automatic TDM system for extracting materials and related properties from scientific literature, we developed Supercuration. A machine-assisted curation tool for accelerating the manual data process in SuperCon. Supercuration interfaces with the automatic TDM system and allows users to visualise and correct the extracted data from a document. Output data (materials - properties) are aggregated as an editable table and the originally extracted entities are visualised as markers on a layer on top of the original documents.