Comparison of machine learning techniques for the automatic recognition of soundscapes

Xavier Valero, Pau Farré, Francesc Alías

Producció científica: Article en revista indexadaArticle de conferènciaAvaluat per experts

4 Cites (Scopus)


Rather than recognising single environmental sound sources, research on soundscape recognition aims at identifying general unstructured auditory scenes composed of multiple simultaneous sound sources. Nowadays, soundscape recognition has multiple applications, such as: i) contextawareness for mobile robots and portable devices, so as to provide information about the surrounding acoustic environment, enabling the automatic reaction of the device without human intervention; ii) background noise identification for speech recognition systems, so as to improve the robustness of such systems by facilitating their adaptability to any indoor or outdoor environment; iii) support tool in urban planning and noise annoyance assessment. In this paper, pattern recognition techniques specially designed to automatically recognise soundscapes are implemented. Specifically, the study focuses on finding the most appropriated learning paradigm. A set of machine learning techniques, commonly employed in similar sound recognition tasks, are considered: decision trees, K-Nearest Neighbours algorithm, Gaussian Mixture Models and Neural Networks. Extensive experiments are carried out so as to empirically test and compare the performance attained by the different learning techniques. The experiments are performed employing an audio data corpus composed of 15 different soundscenes, such as park, traffic street, restaurant or stadium.

Idioma originalAnglès
Pàgines (de-a)2037-2042
Nombre de pàgines6
RevistaProceedings of Forum Acusticum
Estat de la publicacióPublicada - 2011
Esdeveniment6th Forum Acusticum 2011 - Aalborg, Denmark
Durada: 27 de juny 20111 de jul. 2011


Navegar pels temes de recerca de 'Comparison of machine learning techniques for the automatic recognition of soundscapes'. Junts formen un fingerprint únic.

Com citar-ho