Prosodic feature extraction for assessment and treatment of dysarthria

Ijitona, Tolulope and Di Caterina, Gaetano and Yue, Hong and Soraghan, John (2016) Prosodic feature extraction for assessment and treatment of dysarthria. In: UK Speech Conference 2016, 2015-07-20 - 2016-06-21, University of Sheffield.

[thumbnail of Ijitona-etal-UKSC-2016-Prosodic-feature-extraction-for-assessment-and-treatment-of-dysarthria]
Text. Filename: Ijitona_etal_UKSC_2016_Prosodic_feature_extraction_for_assessment_and_treatment_of_dysarthria.pdf
Final Published Version

Download (75kB)| Preview


Dysarthria, a neurological motor speech disorder caused by lesions to the central and peripheral nervous system, accounts for over 40% of neurological disorders referred to pathologists in 2013[1]. This affects the ability of speakers to control the movement of speech production muscles due to muscle weakness. Dysarthria is characterised by reduced loudness, high pitch variability, monotonous speech, poor voice quality and reduced intelligibility [2]. Current techniques for dysarthria assessment are based on perception, which do not give objective measurements for the severity of this speech disorder. There is therefore a need to explore objective techniques for dysarthria assessment and treatment. The goal of this research is to identify and extract the main acoustic features which can be used to describe the type and severity of this disorder. An acoustic feature extraction and classification technique is proposed in this work. The proposed method involves a pre-processing stage where audio samples are filtered to remove noise and resampled at 8 kHz. The next stage is a feature extraction stage where pitch, intensity, formants, zero-crossing rate, speech rate and cepstral coefficients are extracted from the speech samples. Classification of the extracted features is carried out using a single layer neural network. After the classification, a treatment tool is to be developed to assist patients, through tailored exercises, to improve their articulatory ability, intelligibility, intonation and voice quality. Consequently, this proposed technique will assist speech therapists in tracking the progress of patients over time. It will also provide an acoustic objective measurement for dysarthria severity assessment. Some of the potential applications of this technology include management of cognitive speech impairments, treatment of speech difficulties in children and other advanced speech and language applications.


Ijitona, Tolulope ORCID logoORCID:, Di Caterina, Gaetano ORCID logoORCID:, Yue, Hong ORCID logoORCID: and Soraghan, John ORCID logoORCID:;