JSAI2023

Presentation information

General Session

General Session » GS-7 Vision, speech media processing

[1O3-GS-7] Vision, speech media processing

Tue. Jun 6, 2023 1:00 PM - 2:40 PM Room O (E1+E2)

座長:田崎 豪(名城大学) [オンライン]

2:20 PM - 2:40 PM

[1O3-GS-7-05] Depression score estimating method using acoustic features of speech utterances

〇Koki Mori1, Kazuya Mera1, Yoshiaki Kurosawa1, Toshiyuki Takezawa1 (1. Hiroshima City University)

Keywords:Voice Analysis, Beck Depression Inventory, Machine Learning

In this paper, we propose a method to estimate speaker's depression score using acoustic features of his/her speech. 150 speech utterances that 15 subjects read 10 types of sentences were recorded as training data, and the depression scores of the subjects were calculated by Beck Depression Inventory (BDI) just after the recording. Acoustic features are calculated by using openSMILE or Surfboard, and Support Vector Regression or LightGBM are used for machine learning procedure. The experimental results showed that the estimated depression scores obtained a correlate efficient of 0.932 with the correct answer.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password