A semi-automatic pipeline for transcribing and segmenting child speech
Christodoulidou, Polychronia and Tanner, James and Stuart-Smith, Jane and McAuliffe, Michael and Murali, Mridhula and Smith, Amy and Taylor, Lauren and Cleland, Joanne and Kuschmann, Anja; (2025) A semi-automatic pipeline for transcribing and segmenting child speech. In: Proceedings of Interspeech 2025. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH . ISCA Archive, NLD, pp. 4278-4282. (https://www.isca-archive.org/interspeech_2025/chri...)
Preview |
Text.
Filename: Christodoulidou-etal-Interspeech-2025-A-semi-automatic-pipeline-for-transcribing-and-segmenting-child-speech.pdf
Final Published Version License: Other Download (558kB)| Preview |
Abstract
This study evaluates both automated transcription (WhisperX) and forced alignment (MFA) in developing a semi-automated pipeline for obtaining acoustic vowel measures from field recordings from 275 children speaking a non-standard, English dialect, Scottish English. As expected, manual correction of speech transcriptions before forced alignment improves the quality of acoustic vowel measures with respect to manually-annotated data, though speech style and recording environment present some challenges for both tools. Adaptation of the MFA pre-trained english_us_arpa acoustic model towards the children's speech also improves the quality of acoustic measures, though greater improvement was not found by increasing training sample size.
ORCID iDs
Christodoulidou, Polychronia, Tanner, James, Stuart-Smith, Jane, McAuliffe, Michael, Murali, Mridhula
ORCID: https://orcid.org/0000-0001-5450-6419, Smith, Amy
ORCID: https://orcid.org/0009-0001-6303-0691, Taylor, Lauren, Cleland, Joanne
ORCID: https://orcid.org/0000-0002-0660-1646 and Kuschmann, Anja
ORCID: https://orcid.org/0000-0001-5396-9008;
-
-
Item type: Book Section ID code: 93093 Dates: DateEvent14 August 2025Published19 May 2025AcceptedSubjects: Medicine > Internal medicine > Neuroscience. Biological psychiatry. Neuropsychiatry > Communicative disorders. Speech and language disorders Department: Faculty of Humanities and Social Sciences (HaSS) > Psychological Sciences and Health > Speech and Language Therapy
Strategic Research Themes > Health and WellbeingDepositing user: Pure Administrator Date deposited: 12 Jun 2025 09:48 Last modified: 08 Jan 2026 20:59 Related URLs: URI: https://strathprints.strath.ac.uk/id/eprint/93093
Tools
Tools





