Dataset and Evaluation of Automatic Speech Recognition for Multi-lingual Intent Recognition on Social Robots

Antonio Andriella, Raquel Ros, Yoav Ellinson, Sharon Gannot, Séverin Lemaignan

Producción científica: Capítulo del libroContribución a congreso/conferenciarevisión exhaustiva

Resumen

While Automatic Speech Recognition (ASR) systems excel in controlled environments, challenges arise in robot-specific setups due to unique microphone requirements and added noise sources. In this paper, we create a dataset of initiating conversations with brief exchanges in 5 European languages, and we systematically evaluate current state-of-art ASR systems (Vosk, OpenWhisper, Google Speech and NVidia Riva). Besides standard metrics, we also look at two critical downstream tasks for human-robot verbal interaction: intent recognition rate and entity extraction, using the open-source Rasa chatbot. Overall, we found that open-source solutions as Vosk performs competitively with closed-source solutions while running on the edge, on a low compute budget (CPU only).

Idioma originalInglés
Título de la publicación alojadaHRI 2024 - Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction
EditorialIEEE Computer Society
Páginas865-869
Número de páginas5
ISBN (versión digital)9798400703225
DOI
EstadoPublicada - 11 mar 2024
Publicado de forma externa
Evento19th Annual ACM/IEEE International Conference on Human-Robot Interaction, HRI 2024 - Boulder, Estados Unidos
Duración: 11 mar 202415 mar 2024

Serie de la publicación

NombreACM/IEEE International Conference on Human-Robot Interaction
ISSN (versión digital)2167-2148

Conferencia

Conferencia19th Annual ACM/IEEE International Conference on Human-Robot Interaction, HRI 2024
País/TerritorioEstados Unidos
CiudadBoulder
Período11/03/2415/03/24

Huella

Profundice en los temas de investigación de 'Dataset and Evaluation of Automatic Speech Recognition for Multi-lingual Intent Recognition on Social Robots'. En conjunto forman una huella única.

Citar esto