TY - GEN
T1 - Classification of audio scenes using Narrow-Band Autocorrelation features
AU - Valero, Xavier
AU - Aliás, Francesc
PY - 2012
Y1 - 2012
N2 - Multiple single sound events of very different characteristics might coincide in a given space and time, thus composing complex audio scenes. In that context, defining signal features capable of effectively analyzing the holistic audio scenes is a challenging task. This paper introduces a set of features that consider the temporal, spectral and perceptual characteristics of the audio scene signals. Specifically, the features are obtained from the autocorrelation function of band-pass signals computed after applying a Mel filter bank. The so-called Narrow-Band Autocorrelation (NB-ACF) features are compared to state-of-the-art signal features on a corpus of 4 hours composed of 15 audio scenes. Regardless of the learning algorithm employed, the NB-ACF attains the highest averaged recognition rates: 2.3 % higher than Mel Frequency Cepstral Coefficients and 5.6 % higher than Discrete Wavelet Coefficients.
AB - Multiple single sound events of very different characteristics might coincide in a given space and time, thus composing complex audio scenes. In that context, defining signal features capable of effectively analyzing the holistic audio scenes is a challenging task. This paper introduces a set of features that consider the temporal, spectral and perceptual characteristics of the audio scene signals. Specifically, the features are obtained from the autocorrelation function of band-pass signals computed after applying a Mel filter bank. The so-called Narrow-Band Autocorrelation (NB-ACF) features are compared to state-of-the-art signal features on a corpus of 4 hours composed of 15 audio scenes. Regardless of the learning algorithm employed, the NB-ACF attains the highest averaged recognition rates: 2.3 % higher than Mel Frequency Cepstral Coefficients and 5.6 % higher than Discrete Wavelet Coefficients.
KW - Audio classification
KW - autocorrelation function
KW - environmental sound recognition
KW - feature extraction
KW - narrow-band signal analysis
UR - http://www.scopus.com/inward/record.url?scp=84869791462&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84869791462
SN - 9781467310680
T3 - European Signal Processing Conference
SP - 2015
EP - 2019
BT - Proceedings of the 20th European Signal Processing Conference, EUSIPCO 2012
T2 - 20th European Signal Processing Conference, EUSIPCO 2012
Y2 - 27 August 2012 through 31 August 2012
ER -