JSAI2025

Presentation information

International Session

International Session » IS-2 Machine learning

[3K5-IS-2b] Machine learning

Thu. May 29, 2025 3:40 PM - 5:20 PM Room K (Room 1006)

Chair: 矢田 勝俊

3:40 PM - 4:00 PM

[3K5-IS-2b-01] Loan Default Prediction via Discrepancies Between Textual Narratives and Structured Data in Peer-to-Peer Lending

〇Lingxiao Dan1, Ryutaro Ichise1 (1. Institute of Science Tokyo)

Keywords:Credit Risk, P2P Lending, Discrepancy, Deep Learning

Predicting loan defaults is an important task in peer-to-peer (P2P) lending platforms to mitigate financial risks and ensure platform sustainability. Traditional methods rely heavily on structured data such as credit scores, income, and loan amounts. While recent studies also have considered integration of textual data from applications, they overlook the latent link between the textual narratives and structured data. This study focuses on the discrepancy—the misalignment between a borrower's actual financial state (captured by structured data) and their self-reported or perceived circumstances (reflected in textual narratives). Borrowers exhibiting poor self-awareness, overconfidence, or dishonesty are more likely to create such discrepancies, making them prone to default. This study proposes a novel framework to extract and quantify these discrepancies by analyzing borrower-submitted narratives alongside structured financial data. Our framework demonstrates that incorporating text-data discrepancies enhances AUC performance, providing a new perspective for risk assessment. Our model achieves performance comparable to traditional soft-information extraction methods while delivering superior interpretability through its explainable, discrepancy-driven framework.

Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password