Classification of audio scenes using Narrow-Band Autocorrelation features

Xavier Valero, Francesc Aliás

Producció científica: Capítol de llibreContribució a congrés/conferènciaAvaluat per experts

7 Cites (Scopus)

Resum

Multiple single sound events of very different characteristics might coincide in a given space and time, thus composing complex audio scenes. In that context, defining signal features capable of effectively analyzing the holistic audio scenes is a challenging task. This paper introduces a set of features that consider the temporal, spectral and perceptual characteristics of the audio scene signals. Specifically, the features are obtained from the autocorrelation function of band-pass signals computed after applying a Mel filter bank. The so-called Narrow-Band Autocorrelation (NB-ACF) features are compared to state-of-the-art signal features on a corpus of 4 hours composed of 15 audio scenes. Regardless of the learning algorithm employed, the NB-ACF attains the highest averaged recognition rates: 2.3 % higher than Mel Frequency Cepstral Coefficients and 5.6 % higher than Discrete Wavelet Coefficients.

Idioma originalAnglès
Títol de la publicacióProceedings of the 20th European Signal Processing Conference, EUSIPCO 2012
Pàgines2015-2019
Nombre de pàgines5
Estat de la publicacióPublicada - 2012
Esdeveniment20th European Signal Processing Conference, EUSIPCO 2012 - Bucharest, Romania
Durada: 27 d’ag. 201231 d’ag. 2012

Sèrie de publicacions

NomEuropean Signal Processing Conference
ISSN (imprès)2219-5491

Conferència

Conferència20th European Signal Processing Conference, EUSIPCO 2012
País/TerritoriRomania
CiutatBucharest
Període27/08/1231/08/12

Fingerprint

Navegar pels temes de recerca de 'Classification of audio scenes using Narrow-Band Autocorrelation features'. Junts formen un fingerprint únic.

Com citar-ho