Evaluating synthetic speech workload with oculo-motor indices : preliminary observations for Japanese speech

Dubiel, Mateusz and Nakayama, Minoru and Wang, Xin (2021) Evaluating synthetic speech workload with oculo-motor indices : preliminary observations for Japanese speech. In: BIOSIGNALS 2021, 2021-02-11 - 2021-02-13, Online Event. (https://doi.org/10.5220/0010341303350342)

[thumbnail of Dubiel-etal-Biosignal-2021-Evaluating-synthetic-speech-workload-with-oculo-motor-indices]
Preview
Text. Filename: Dubiel_etal_Biosignal_2021_Evaluating_synthetic_speech_workload_with_oculo_motor_indices.pdf
Accepted Author Manuscript

Download (539kB)| Preview

Abstract

Pupillometry has recently been introduced as a method to evaluate cognitive workload of synthetic speech. Prior research conducted on English speech indicates that in noisy listening conditions, pupil dilation is significantly higher for synthetic speech as compared to natural speech. In a lab-based listening experiment, we evaluated participants' (n=16) pupil responses to Japanese speech (natural vs. synthetic) at three different signal-to-noise levels (-1dB, -3dB and -5dB). Our research expands on previous work by evaluating pupillary responses both in terms of temporal changes in pupil size and degree of pupil oscillations. We observe statistically significant differences in pupil sizes at the recall stage between each type of speech. For pupil oscillations, we register statistically significant differences in frequency power spectrum densities (PSDs). Our investigation proposes an expansion of the current synthetic speech evaluation methods that are based on pupillary responses and outlines possible avenues for future research that arise from the findings of this work.