Exploring Speech Biosignatures for Traumatic Brain Injury and Neurodegeneration: A Pilot Machine Learning Study

Rubaiat, Rahmina; Templeton, John Michael; Schneider, Sandra L; Silva, Upeka De; Madanian, Samaneh; Poellabauer, Christian

doi:10.2196/64624

Exploring Speech Biosignatures for Traumatic Brain Injury and Neurodegeneration: A Pilot Machine Learning Study

Files

Journal article

Size: 755.79 KB, File format: Adobe PDF

Date

2024-07-23

Authors

Rubaiat, Rahmina

Templeton, John Michael

Schneider, Sandra L

Silva, Upeka De

Madanian, Samaneh

Poellabauer, Christian

Item type

Journal Article

Publisher

JMIR Publications

Abstract

Background: Speech features are increasingly linked to neurodegenerative and mental health conditions, offering the potential for early detection and differentiation between disorders. As interest in speech analysis grows, distinguishing between conditions becomes critical for reliable diagnosis and assessment. Objective: This pilot study explores speech biosignatures in two distinct neurodegenerative conditions: (1) mild traumatic brain injuries (eg, concussions) and (2) Parkinson disease (PD) as the neurodegenerative condition. Methods: The study included speech samples from 235 participants (97 concussed and 94 age-matched healthy controls, 29 PD and 15 healthy controls) for the PaTaKa test and 239 participants (91 concussed and 104 healthy controls, 29 PD and 15 healthy controls) for the Sustained Vowel (/ah/) test. Age-matched healthy controls were used. Young age-matched controls were used for concussion and respective age-matched controls for neurodegenerative participants (15 healthy samples for both tests). Data augmentation with noise was applied to balance small datasets for neurodegenerative and healthy controls. Machine learning models (support vector machine, decision tree, random forest, and Extreme Gradient Boosting) were employed using 37 temporal and spectral speech features. A 5-fold stratified cross-validation was used to evaluate classification performance. Results: For the PaTaKa test, classifiers performed well, achieving F1-scores above 0.9 for concussed versus healthy and concussed versus neurodegenerative classifications across all models. Initial tests using the original dataset for neurodegenerative versus healthy classification yielded very poor results, with F1-scores below 0.2 and accuracy under 30% (eg, below 12 out of 44 correctly classified samples) across all models. This underscored the need for data augmentation, which significantly improved performance to 60%‐70% (eg, 26‐31 out of 44 samples) accuracy. In contrast, the Sustained Vowel test showed mixed results; F1-scores remained high (more than 0.85 across all models) for concussed versus neurodegenerative classifications but were significantly lower for concussed versus healthy (0.59‐0.62) and neurodegenerative versus healthy (0.33‐0.77), depending on the model. Conclusions: This study highlights the potential of speech features as biomarkers for neurodegenerative conditions. The PaTaKa test exhibited strong discriminative ability, especially for concussed versus neurodegenerative and concussed versus healthy tasks, whereas challenges remain for neurodegenerative versus healthy classification. These findings emphasize the need for further exploration of speech-based tools for differential diagnosis and early identification in neurodegenerative health.

Keywords

46 Information and Computing Sciences, 4611 Machine Learning, 52 Psychology, Machine Learning and Artificial Intelligence, Traumatic Head and Spine Injury, Neurodegenerative, Brain Disorders, Physical Injury - Accidents and Adverse Effects, Neurosciences, Traumatic Brain Injury (TBI), Neurological, 3 Good Health and Well Being

Source

JMIR Neurotechnology, ISSN: 2817-092X (Print); 2817-092X (Online), JMIR Publications, 4. doi: 10.2196/64624

DOI

10.2196/64624

Publisher's version

https://neuro.jmir.org/2025/1/e64624

Rights statement

© Rahmina Rubaiat, John Michael Templeton, Sandra L Schneider, Upeka De Silva, Samaneh Madanian, Christian Poellabauer. Originally published in JMIR Neurotechnology (https://neuro.jmir.org), 12.2.2025. This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Neurotechnology, is properly cited. The complete bibliographic information, a link to the original publication on https://neuro.jmir.org, as well as this copyright and license information must be included.

Permanent link

http://hdl.handle.net/10292/18670

Collections

School of Engineering, Computer and Mathematical Sciences - Te Kura Mātai Pūhanga, Rorohiko, Pāngarau
The Brain Health Research Institute

Creative Commons license

© Rahmina Rubaiat, John Michael Templeton, Sandra L Schneider, Upeka De Silva, Samaneh Madanian, Christian Poellabauer. Originally published in JMIR Neurotechnology (https://neuro.jmir.org), 12.2.2025.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Neurotechnology, is properly cited. The complete bibliographic information, a link to the original publication on https://neuro.jmir.org, as well as this copyright and license information must be included.

Except where otherwise noted, this item's license is described as © Rahmina Rubaiat, John Michael Templeton, Sandra L Schneider, Upeka De Silva, Samaneh Madanian, Christian Poellabauer. Originally published in JMIR Neurotechnology (https://neuro.jmir.org), 12.2.2025. This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Neurotechnology, is properly cited. The complete bibliographic information, a link to the original publication on https://neuro.jmir.org, as well as this copyright and license information must be included.

Full item page

Exploring Speech Biosignatures for Traumatic Brain Injury and Neurodegeneration: A Pilot Machine Learning Study

Files

Date

Authors

Supervisor

Item type

Degree name

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Source

DOI

Publisher's version

Rights statement

Permanent link

Collections

Endorsement

Review

Supplemented By

Referenced By

Creative Commons license