Exploring digital speech biomarkers of hypokinetic dysarthria in a multilingual cohort

Varování

Publikace nespadá pod Pedagogickou fakultu, ale pod Středoevropský technologický institut. Oficiální stránka publikace je na webu muni.cz.

Autoři	KOVÁČ Daniel MEKYSKA Jiří AHARONSON Vered HARAR Pavol GALAZ Zoltan RAPCSAK Steven OROZCO-ARROYAVE Juan Rafael BRABENEC Luboš REKTOROVÁ Irena
Rok publikování	2024
Druh	Článek v odborném periodiku
Časopis / Zdroj	Biomedical Signal Processing and Control
Fakulta / Pracoviště MU	Středoevropský technologický institut
Citace
www	https://www.sciencedirect.com/science/article/pii/S174680942301100X?via%3Dihub
Doi	http://dx.doi.org/10.1016/j.bspc.2023.105667
Klíčová slova	Hypokinetic dysarthria; Parkinson's disease; Multilingual study; Acoustic speech features; Statistical analysis; Machine learning
Přiložené soubory	Exploring_digital_speech_biomarkers_of_hypokinetic_dysarthria_in_a_multilingual_cohort.pdf
Popis	Hypokinetic dysarthria, a motor speech disorder characterized by reduced movement and control in the speech -related muscles, is mostly associated with Parkinson's disease. Acoustic speech features thus offer the potential for early digital biomarkers to diagnose and monitor the progression of this disease. However, the influence of language on the successful classification of healthy and dysarthric speech remains crucial. This paper explores the analysis of acoustic speech features, both established and newly proposed, in a multilingual context to support the diagnosis of PD. The study aims to identify language-independent and highly discriminative digital speech biomarkers using statistical analysis and machine learning techniques. The study analyzes thirty-three acoustic features extracted from Czech, American, Israeli, Columbian, and Italian PD patients, as well as healthy controls. The analysis employs correlation and statistical tests, descriptive statistics, and the XGBoost classifier. Feature importances and Shapley values are used to provide explanations for the classification results. The study reveals that the most discriminative features, with reduced language dependence, are those measuring the prominence of the second formant, monopitch, and the frequency of pauses during text reading. Classification accuracies range from 67% to 85%, depending on the language. This paper introduces the concept of language robustness as a desirable quality in digital speech biomarkers, ensuring consistent behaviour across languages. By leveraging this concept and employing additional metrics, the study proposes several language-independent digital speech biomarkers with high discrimination power for diagnosing PD.
Související projekty:	Novel Network-Based Approaches for Studying Cognitive Dysfunction in Behavioral Neurology Diagnostika onemocnění s Lewyho tělísky v prodromálním stadiu založená na analýze multimodálních dat Národní ústav pro neurologický výzkum