The role of valence, dominance, and pitch in social perceptions of artificial intelligence (AI) conversational agents' voices

M. Shiramizu, Victor Kenji and Lee, Anthony J and Altenburg, Daria and Feinberg, David R. and Jones, Benedict C. (2022) The role of valence, dominance, and pitch in social perceptions of artificial intelligence (AI) conversational agents' voices. Scientific Reports, 12 (1). 22479. ISSN 2045-2322 (https://doi.org/10.1038/s41598-022-27124-8)

[thumbnail of Shiramizu-etal-SR-2022-The-role-of-valence-dominance-and-pitch-in-social-perceptions-of-artificial-intelligence]
Preview
Text. Filename: Shiramizu_etal_SR_2022_The_role_of_valence_dominance_and_pitch_in_social_perceptions_of_artificial_intelligence.pdf
Final Published Version
License: Creative Commons Attribution 4.0 logo

Download (1MB)| Preview
[thumbnail of s41598-022-27124-8]
Preview
Text. Filename: s41598_022_27124_8.pdf
Final Published Version
License: Creative Commons Attribution 4.0 logo

Download (1MB)| Preview

Abstract

There is growing concern that artificial intelligence conversational agents (e.g., Siri, Alexa) reinforce voice-based social stereotypes. Because little is known about social perceptions of conversational agents' voices, we investigated (1) the dimensions that underpin perceptions of these synthetic voices and (2) the role that acoustic parameters play in these perceptions. Study 1 (N = 504) found that perceptions of synthetic voices are underpinned by Valence and Dominance components similar to those previously reported for natural human stimuli and that the Dominance component was strongly and negatively related to voice pitch. Study 2 (N = 160) found that experimentally manipulating pitch in synthetic voices directly influenced dominance-related, but not valence-related, perceptions. Collectively, these results suggest that greater consideration of the role that voice pitch plays in dominance-related perceptions when designing conversational agents may be an effective method for controlling stereotypic perceptions of their voices and the downstream consequences of those perceptions.