JSAI2020

Presentation information

General Session

General Session » J-2 Machine learning

[2I1-GS-2] Machine learning: Random forest

Wed. Jun 10, 2020 9:00 AM - 10:40 AM Room I (jsai2020online-9)

座長:小山田昌史(NEC)

10:00 AM - 10:20 AM

[2I1-GS-2-04] Machine Learning based Prediction of DPC from Discharge Summaries

〇Tomohiro Kimura1, Shusaku Tsumoto2, Shoji Hirano2 (1. Division of Medical Service, Faculty of Medicine, Shimane University, 2. Department of Medical Informatics, Faculty of Medicine, Shimane University)

Keywords:Text mining, Random Forest, Deep Learning, Discharge Summary

This paper proposes a method for construction of classifiers for discharge summaries, composed of the following five steps First, morphological analysis is applied to a set of summaries and a term matrix is generated. Second, correspondence analysis is applied to the classification labels and the term matrix and generates two dimensional coordinates for all the terms and labels. Third, by measuring the distances between categories and the terms, ranking of key words is generated. Fourthly, keywords are selected as attributes according to the ranks, and training examples for classifiers will be generated. Finally, machine learning methods are applied to the training examples. Experimental validation shows that random forest achieved the best performance and the second best was the deep learners, but decision tree methods with many keywords performed only a little worse than neural network or deep learning methods.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password