TY - GEN
T1 - The DeuteroNoise Dataset
T2 - 26th International Conference of the Catalan Association for Artificial Intelligence, CCIA 2024
AU - Nou-Plana, Ignasi
AU - Freixes, Marc
AU - Vaquerizo-Serrano, Jesús
AU - Arnela, Marc
AU - Cañestro, Cristian
AU - Quintana, Eva R.
AU - Teaca, Adrian
AU - Chatzigeorgiou, Marios
AU - Ristoratore, Filomena
AU - Zambon, Giovanni
AU - Manni, Lucia
AU - Alsina-Pagès, Rosa Ma
N1 - Publisher Copyright:
© 2024 The Authors.
PY - 2024/9/25
Y1 - 2024/9/25
N2 - The vast and largely unexplored underwater environment is a rich source of diverse sound events. These sounds, which range from the calls of marine life to the noise generated by human activities, create a complex acoustic environment. Research has been conducted on gathering and categorizing this type of data. However, only a few databases have been recently openly published focusing on anthropogenic sounds. This paper outlines the preliminary steps towards the creation of a comprehensive dataset for the detection of underwater sound events, with an initial emphasis on the sounds of vessels and boats. Within the framework of the JPI Oceans project, DeuteroNoise, a wide spectrum of vessel sounds under varying conditions has been captured and annotated with the necessary metadata for sound event detection tasks. Moreover, the proposed dataset will facilitate the development of more accurate and adaptable vessel sound event detection models and encourage further research in this area. The dataset contains raw audio files and the respective initial analysis based on labeled events, duration of the events, signal-to-noise ratio (SNR) and impact measurements. The retrieved data can be grouped into noisy and non-noisy spots. The noisy locations include the Port of Barcelona (Spain), the Port of Constanta (Romania), and the Lagoon of Venice (Italy). In contrast, non-noisy data was collected during two measurement campaigns at Pont del Petroli in Badalona (Spain). In addition to the dataset, to illustrate its potential application, a classifier for vessel/boat sound events is proposed. The classifier uses mel spectrograms as input data and is built on a pre-trained model that leverages a residual neural network. This system is capable of classifying vessel/boat related events from background sound environment.
AB - The vast and largely unexplored underwater environment is a rich source of diverse sound events. These sounds, which range from the calls of marine life to the noise generated by human activities, create a complex acoustic environment. Research has been conducted on gathering and categorizing this type of data. However, only a few databases have been recently openly published focusing on anthropogenic sounds. This paper outlines the preliminary steps towards the creation of a comprehensive dataset for the detection of underwater sound events, with an initial emphasis on the sounds of vessels and boats. Within the framework of the JPI Oceans project, DeuteroNoise, a wide spectrum of vessel sounds under varying conditions has been captured and annotated with the necessary metadata for sound event detection tasks. Moreover, the proposed dataset will facilitate the development of more accurate and adaptable vessel sound event detection models and encourage further research in this area. The dataset contains raw audio files and the respective initial analysis based on labeled events, duration of the events, signal-to-noise ratio (SNR) and impact measurements. The retrieved data can be grouped into noisy and non-noisy spots. The noisy locations include the Port of Barcelona (Spain), the Port of Constanta (Romania), and the Lagoon of Venice (Italy). In contrast, non-noisy data was collected during two measurement campaigns at Pont del Petroli in Badalona (Spain). In addition to the dataset, to illustrate its potential application, a classifier for vessel/boat sound events is proposed. The classifier uses mel spectrograms as input data and is built on a pre-trained model that leverages a residual neural network. This system is capable of classifying vessel/boat related events from background sound environment.
KW - AI Applications
KW - Data Mining and Knowledge Discovery from Databases
UR - http://www.scopus.com/inward/record.url?scp=85217012054&partnerID=8YFLogxK
U2 - 10.3233/FAIA240446
DO - 10.3233/FAIA240446
M3 - Conference contribution
AN - SCOPUS:85217012054
T3 - Frontiers in Artificial Intelligence and Applications
SP - 260
EP - 269
BT - Artificial Intelligence Research and Development - Proceedings of the 26th International Conference of the Catalan Association for Artificial Intelligence
A2 - Alsinet, Teresa
A2 - Vilasis--Cardona, Xavier
A2 - Garcia-Costa, Daniel
A2 - Alvarez-Garcia, Elena
PB - IOS Press BV
Y2 - 2 October 2024 through 4 October 2024
ER -