Interpretable two-step prediction method and its application to agricultural and medical data

Jianqiang Sun

9:40 AM - 10:00 AM

[3I1-GS-13-03] Interpretable two-step prediction method and its application to agricultural and medical data

〇Jianqiang Sun¹, Aika Terada², Eri Yamasaki³, Kentaro K Shimizu^3,4, Jun Sese² (1. National Agriculture and Food Research Organization, 2. Humanome Lab., Inc., 3. University of Zurich, 4. Yokohama City University)

Keywords:regression analysis, bioinformatics, medical artificial intelligence, agriculture

Machine learning is expected to be applied in life science. However, data structures observed in life science fields tend to be high dimensions with relatively small sample sizes, which cause model overfitting and unexplainability. A sparse modeling algorithm LASSO is one of the possibilities to overcome the problems; nevertheless, the prediction performance of LASSO is usually worse than general algorithms such as neural networks. To achieve both performance and explainability, we proposed a two-phase prediction method. In this method, instead of predicting labels from features directly, we independently build two models: one predicts intermediate status from features, and the other predicts labels from the intermediate status; then combine the two models. Features of molecular biology such as gene expression are recommended to use as intermediate status. To evaluate our method, we applied the method to ecological data for flowering prediction and medical data for cancer type prediction. The results of both applications indicated that our approach ensures performance and explainability.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Presentation information

[3I1-GS-13] AI application: Social application (2)

[3I1-GS-13-03] Interpretable two-step prediction method and its application to agricultural and medical data

Password