Sign language recognition using spiking neural networks

Chaudhari, Pranav and Vicente Sola, Alex and Basu, Amlan and Manna, Davide L. and Kirkland, Paul and Di Caterina, Gaetano (2023) Sign language recognition using spiking neural networks. Procedia Computer Science. pp. 1-10. ISSN 1877-0509 (In Press)

[thumbnail of Chaudhari-etal-PSC-2023-Sign-language-recognition-using-spiking-neural-networks] Text. Filename: Chaudhari_etal_PSC_2023_Sign_language_recognition_using_spiking_neural_networks.pdf
Accepted Author Manuscript
Restricted to Repository staff only until 1 January 2099.
License: Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 logo

Download (420kB) | Request a copy

Abstract

In recent years, research in automatic Sign Language Recognition (SLR) has undergone significant progress, serving as a foundational base for developing applications that aim to promote the integration of deaf individuals into society. Most of this progress is owed to the recent developments in deep learning. However, the deployment of conventional Artificial Neural Networks (ANNs) can be hindered by their requirements in terms of computational power and energy consumption. Therefore, to improve the efficiency of current SLR systems, in this work, we propose the use of the increasingly popular Spiking Neural Networks (SNNs), which, on the one hand, provide more energy-efficient computations than conventional ANNs and, on the other hand, are able to process temporal sequences with simpler architectures thanks to their temporal dynamics. To evaluate our method, we utilize WLASL300, the 300-word (300 classes of signs) dataset fromWord-Level American Sign Language, and achieve an improvement in accuracy with the SNN (+2.70%) over the previous state-of-the-art, when working with energy-efficient spiking neurons. Furthermore, we construct a non-spiking version of the same network and evaluate it in a similar manner. Our results demonstrate how the SNN has sparser activations (25% less), thanks to the use of spiking neurons, and therefore can be implemented with a lower power requirement than an ANN version of the same architecture. This work thus demonstrates the possibility of performing SLR in a very effective and efficient way, thus opening up the development of applications that span from the automatic real-time translation of dynamic signs to remote control utilizing sign languages.