Abstract
Early and adequate diagnosis of speech disorders can contribute to the quality of the treatment and thus to treatment success rates. Using acoustic analysis of the speech of children with speech disorders may aid therapists in the diagnostic process by identifying the acoustic characteristics that are unique to a specific disorder and that distinguish it from normal speech development. The purpose of this work is to investigate the feasibility of the automatic detection of speech disorders based on children’s voices. In this preliminary study, using a dataset of utterance recordings of 24 children whose mother tongue is Hebrew, we propose an automatic system that may facilitate accurate speech assessment by therapists by providing a preliminary diagnosis and explainable insights about the model’s predictions. We built a serial, two-step network that is both powerful and possibly interpretable. The first step can model the complex relations between acoustic features and the speech disorder while the second can shed light on the utterances that make the greatest contribution to the final classification. Our preliminary results focus on the broad spectrum of speech disorders. In future work, we plan to design a system that will be able to detect childhood apraxia of speech (CAS) specifically and shed light on the differences in the speech of individuals with CAS and those with other speech disorders.
Original language | English |
---|---|
Title of host publication | Speech and Computer - 22nd International Conference, SPECOM 2020, Proceedings |
Editors | Alexey Karpov, Rodmonga Potapova |
Publisher | Springer Science and Business Media Deutschland GmbH |
Pages | 509-519 |
Number of pages | 11 |
ISBN (Print) | 9783030602758 |
DOIs | |
State | Published - 2020 |
Event | 22nd International Conference on Speech and Computer, SPECOM 2020 - St. Petersburg, Russian Federation Duration: 7 Oct 2020 → 9 Oct 2020 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 12335 LNAI |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 22nd International Conference on Speech and Computer, SPECOM 2020 |
---|---|
Country/Territory | Russian Federation |
City | St. Petersburg |
Period | 7/10/20 → 9/10/20 |
Bibliographical note
Publisher Copyright:© 2020, Springer Nature Switzerland AG.
Keywords
- Childhood Apraxia of Speech (CAS)
- Deep spectrum
- GeMAPS
- Speech disorder