A deep learning method for pathological voice detection using convolutional deep belief networks

Wu, Huiyi and Soraghan, John and Lowit, Anja and Di Caterina, Gaetano (2018) A deep learning method for pathological voice detection using convolutional deep belief networks. In: Interspeech 2018, 2018-09-02 - 2018-09-06.

[thumbnail of WU-etal-Interspeech-2018-A-deep-learning-method-for-pathological-voice-detection]

Preview

Text. Filename: WU_etal_Interspeech_2018_A_deep_learning_method_for_pathological_voice_detection.pdf
Accepted Author Manuscript
Download (509kB)| Preview

Abstract

Automatically detecting pathological voice disorders such as vocal cord paralysis or Reinke’s edema is a challenging and important medical classification problem. While deep learning techniques have achieved significant progress in the speech recognition field there has been less research work in the area of pathological voice disorders detection. A novel system for pathological voice detection using convolutional neural network (CNN) as the basic architecture is presented in this work. The novel system uses spectrograms of normal and pathological speech recordings as the input to the network. Initially Convolutional deep belief network (CDBN) are used to pre-train the weights of CNN system. This acts as a generative model to explore the structure of the input data using statistical methods. Then a CNN is trained using supervised back-propagation learning algorithm to fine tune the weights. It will be shown that a small amount of data can be used to achieve good results in classification with this deep learning approach. A performance analysis of the novel method is provided using real data from the Saarbrucken Voice database

ORCID iDs

Wu, Huiyi, Soraghan, John

, Lowit, Anja

and Di Caterina, Gaetano

;

Share and Export

Item metadata

Item type:	Conference or Workshop Item(Paper)
ID code:	64290
Dates:	Date Event 2 September 2018 Published 3 June 2018 Accepted
Subjects:	Technology > Electrical engineering. Electronics Nuclear engineering
Department:	Faculty of Engineering > Electronic and Electrical Engineering Technology and Innovation Centre > Sensors and Asset Management Faculty of Humanities and Social Sciences (HaSS) > Psychological Sciences and Health > Speech and Language Therapy
Depositing user:	Pure Administrator
Date deposited:	07 Jun 2018 08:45
Last modified:	18 Oct 2024 00:54
Related URLs:	Event
URI:	https://strathprints.strath.ac.uk/id/eprint/64290

CORE (COnnecting REpositories)