The 68th JSAP Spring Meeting 2021

Presentation information

Oral presentation

23 Joint Session N "Informatics" » 23.1 Joint Session N "Informatics"

[19p-Z32-1~15] 23.1 Joint Session N "Informatics"

Fri. Mar 19, 2021 1:30 PM - 5:45 PM Z32 (Z32)

Tetsuhiko Miyadera(AIST), Tatsuya Yokoi(Nagoya Univ.), Yukinori Koyama(NIMS)

4:45 PM - 5:00 PM

[19p-Z32-12] Machine readable extraction of chemically modified materials name

Luca Foppiano1, Sae Dieb1, Pedro Baptista de Castro2, Yan Meng2, Kensei Terashima2, Yoshihiko Takano2, Ishii Masashi1 (1.MDG, MaDIS, NIMS, 2.NFSMG, MANA, NIMS)

Keywords:machine learning, material parser, text mining

We present a machine-learning based material name parser for unstructured text. The parser implements a Conditional Random Field (CRF) model that segments the raw material string in seven component: name (Metal diboride, hydrogen, etc.), chemical formula (La Fe O7, SiH 4, etc.), doping ratio (Zn-doped, pure, etc.), stochiometric variable names and values (x = 1, 2; y = 3), and shape (thin film, powder, etc.). We constructed the training data of 3000 material names, using all the material entities from the SuperMat dataset.