Resum
Analysing behavioural patterns of bird species in a certain region enables researchers to recognize forthcoming changes in environment, ecology, and population. Ornithologists spend many hours observing and recording birds in their natural habitat to compare different audio samples and extract valuable insights. This manual process is typically undertaken by highly-experienced birders that identify every species and its associated type of sound. In recent years, some public repositories hosting labelled acoustic samples from different bird species have emerged, which has resulted in appealing datasets that computer scientists can use to test the accuracy of their machine learning algorithms and assist ornithologists in the time-consuming process of analyzing audio data. Current limitations in the performance of these algorithms come from the fact that the acoustic samples of these datasets combine fragments with only environmental noise and fragments with the bird sound (i.e., the computer confuses environmental sound with the bird sound). Therefore, the purpose of this paper is to release a dataset lasting more than 4984 s that contains differentiated samples of (1) bird sounds and (2) environmental sounds. This data descriptor releases the processed audio samples—originally obtained from the Xeno-Canto repository—from the known seven families of the Picidae species inhabiting the Iberian Peninsula that are good indicators of the habitat quality and have significant value from the environment conservation point of view.
Idioma original | Anglès |
---|---|
Número d’article | 18 |
Revista | Data |
Volum | 2 |
Número | 2 |
DOIs | |
Estat de la publicació | Publicada - de juny 2017 |
Fingerprint
Navegar pels temes de recerca de 'Towards automatic bird detection: An annotated and segmented acoustic dataset of seven picidae species'. Junts formen un fingerprint únic.Com citar-ho
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver
}
In: Data, Vol. 2, Núm. 2, 18, 06.2017.
Producció científica: Article en revista indexada › Article › Avaluat per experts
TY - JOUR
T1 - Towards automatic bird detection
T2 - An annotated and segmented acoustic dataset of seven picidae species
AU - Vidaña-Vila, Ester
AU - Navarro, Joan
AU - Alsina-Pagès, Rosa Ma
N1 - Funding Information: The authors would like to thank project Xeno-Canto for their work gathering and publishing acoustic birdsong recordings. We would like to thank specially the recorders of all the birdsong data, without whom this work would not have been possible. We list all the recorders together with the Xeno-Canto code of the recording we used to generate the annotated database. Albert Lastukhin (XC267333), Alexander Kurthy (XC311605), Antero Lindholm (XC247633), Beatrix Saadi-Varchmin (XC356233), Bram Piot (XC170646), brickegickel (XC313131, XC356127), Buhl Johannes (XC177871), Concord Lexington (XC318086), Danuta Peplowska-Marcza (XC302722), Dare Sere (XC337994), Dawid Jablonski (XC176278), Dmitry Kulakov (XC309571, XC309576, XC309580), Dmitry Yakubovich (XC234609), Eetu Paljakka (XC207080, XC308855, XC308859, XC324852), Elias A. Ryberg (XC313173, XC338390), Francesco Sottile (XC307521, XC311566), Gunnar Fernqvist (XC221271), Hannu Jannes (XC214195), Hans Matheve (XC354870, XC354873), Jack Berteau (XC156677, XC156679), Jarek Matusiak (XC130202, XC153024, XC210380, XC234512), Jelmer Poelstra (XC315966), Jens Kirkeby (XC322593), Jerome Fischer (XC169322, XC187153, XC303432, XC308986), Joost van Bruggen (XC293003, XC321386), Jordi Calvet (XC306722, XC348419), José Carlos Sires (XC343460), Julien Rochefort (XC147107, XC304818), Justin Jansen (XC165371), Krzysztof Deoniziak (XC310278, XC314203, XC314369, XC314609, XC314610, XC314985), Lars Adler Krogh (XC215636), Lars Buckx (XC179826), Lars Lachmann (XC127763, XC252476, XC331306, XC331317), Lauri Hallikainen (XC233286, XC234477, XC234544), Manuel Grosselet (XC298607), Marc Anderson (XC310360), Marco Dragonetti—www.birdsongs.it—(XC331113, XC331114, XC331115, XC331116), Martin Vlk Mrnous (XC355912), Mikael Litsgard (XC237259, XC239371), Miklos Heincz (XC313584), Niels Van Doninck (XC351824), Niels Van Doninck (XC351825), Nikolay Sariev (XC239446), Nils Agster (XC288966), Pascal Christe (XC302364), Patrik Aberg (XC26678, XC293037, XC293038, XC310710, XC343373, XC343374, XC349394, XC349395), PE Svahn (XC212569), Pepe Lehikoinen (XC277517), Peter Mazuryk (XC355288), Ruud van Beusekom (XC46219), Sonnenburg (XC166681, XC234155, XC313701, XC319646, XC342794, XC355052, XC355459, XC355460, XC356033, XC356034), Stanislas Wroza (XC354361), Stein O. Nilsen (XC278527, XC278528), Szymon Plawecki (XC335265, XC335267), Terje Kolaas (XC236456, XC236458, XC236460, XC236833, XC237245, XC302820, XC324357, XC324358), Tero Linjama (XC343654, XC343664, XC343667, XC343668, XC343669, XC343676, XC343683, XC343686, XC343803, XC343805, XC343807), Thomas Luthi (XC357122), Timo Roeke (XC346551), Timo Tschentscher (XC266489), Tom Wulf (XC329937, XC329942, XC329944, XC329955), Tomek Tumiel (XC282634, XC331530), Uku Paal (XC172697, XC215916, XC219334, XC219335, XC219336), Volker Arnold (XC130274). This research has been partially funded by the Secretaria d’Universitats i Recerca del Departament d’Economia i Coneixement (Generalitat de Catalunya) under grants ref. 2014-SGR-0590 and ref. 2014-SGR-589. Funding Information: Acknowledgments: The authors would like to thank project Xeno-Canto for their work gathering and publishing acoustic birdsong recordings. We would like to thank specially the recorders of all the birdsong data, without whom this work would not have been possible. We list all the recorders together with the Xeno-Canto code of the recording we used to generate the annotated database. Albert Lastukhin (XC267333), Alexander Kurthy (XC311605), Antero Lindholm (XC247633), Beatrix Saadi-Varchmin (XC356233), Bram Piot (XC170646), brickegickel (XC313131, XC356127), Buhl Johannes (XC177871), Concord Lexington (XC318086), Danuta Peplowska-Marcza (XC302722), Dare Sere (XC337994), Dawid Jablonski (XC176278), Dmitry Kulakov (XC309571, XC309576, XC309580), Dmitry Yakubovich (XC234609), Eetu Paljakka (XC207080, XC308855, XC308859, XC324852), Elias A. Ryberg (XC313173, XC338390), Francesco Sottile (XC307521, XC311566), Gunnar Fernqvist (XC221271), Hannu Jannes (XC214195), Hans Matheve (XC354870, XC354873), Jack Berteau (XC156677, XC156679), Jarek Matusiak (XC130202, XC153024, XC210380, XC234512), Jelmer Poelstra (XC315966), Jens Kirkeby (XC322593), Jerome Fischer (XC169322, XC187153, XC303432, XC308986), Joost van Bruggen (XC293003, XC321386), Jordi Calvet (XC306722, XC348419), José Carlos Sires (XC343460), Julien Rochefort (XC147107, XC304818), Justin Jansen (XC165371), Krzysztof Deoniziak (XC310278, XC314203, XC314369, XC314609, XC314610, XC314985), Lars Adler Krogh (XC215636), Lars Buckx (XC179826), Lars Lachmann (XC127763, XC252476, XC331306, XC331317), Lauri Hallikainen (XC233286, XC234477, XC234544), Manuel Grosselet (XC298607), Marc Anderson (XC310360), Marco Dragonetti—www.birdsongs.it—(XC331113, XC331114, XC331115, XC331116), Martin Vlk Mrnous (XC355912), Mikael Litsgard (XC237259, XC239371), Miklos Heincz (XC313584), Niels Van Doninck (XC351824), Niels Van Doninck (XC351825), Nikolay Sariev (XC239446), Nils Agster (XC288966), Pascal Christe (XC302364), Patrik Aberg (XC26678, XC293037, XC293038, XC310710, XC343373, XC343374, XC349394, XC349395), PE Svahn (XC212569), Pepe Lehikoinen (XC277517), Peter Mazuryk (XC355288), Ruud van Beusekom (XC46219), Sonnenburg (XC166681, XC234155, XC313701, XC319646, XC342794, XC355052, XC355459, XC355460, XC356033, XC356034), Stanislas Wroza (XC354361), Stein O. Nilsen (XC278527, XC278528), Szymon Plawecki (XC335265, XC335267), Terje Kolaas (XC236456, XC236458, XC236460, XC236833, XC237245, XC302820, XC324357, XC324358), Tero Linjama (XC343654, XC343664, XC343667, XC343668, XC343669, XC343676, XC343683, XC343686, XC343803, XC343805, XC343807), Thomas Luthi (XC357122), Timo Roeke (XC346551), Timo Tschentscher (XC266489), Tom Wulf (XC329937, XC329942, XC329944, XC329955), Tomek Tumiel (XC282634, XC331530), Uku Paal (XC172697, XC215916, XC219334, XC219335, XC219336), Volker Arnold (XC130274). This research has been partially funded by the Secretaria d’Universitats i Recerca del Departament d’Economia i Coneixement (Generalitat de Catalunya) under grants ref. 2014-SGR-0590 and ref. 2014-SGR-589. Publisher Copyright: © 2017 by the authors. Licensee MDPI, Basel, Switzerland.
PY - 2017/6
Y1 - 2017/6
N2 - Analysing behavioural patterns of bird species in a certain region enables researchers to recognize forthcoming changes in environment, ecology, and population. Ornithologists spend many hours observing and recording birds in their natural habitat to compare different audio samples and extract valuable insights. This manual process is typically undertaken by highly-experienced birders that identify every species and its associated type of sound. In recent years, some public repositories hosting labelled acoustic samples from different bird species have emerged, which has resulted in appealing datasets that computer scientists can use to test the accuracy of their machine learning algorithms and assist ornithologists in the time-consuming process of analyzing audio data. Current limitations in the performance of these algorithms come from the fact that the acoustic samples of these datasets combine fragments with only environmental noise and fragments with the bird sound (i.e., the computer confuses environmental sound with the bird sound). Therefore, the purpose of this paper is to release a dataset lasting more than 4984 s that contains differentiated samples of (1) bird sounds and (2) environmental sounds. This data descriptor releases the processed audio samples—originally obtained from the Xeno-Canto repository—from the known seven families of the Picidae species inhabiting the Iberian Peninsula that are good indicators of the habitat quality and have significant value from the environment conservation point of view.
AB - Analysing behavioural patterns of bird species in a certain region enables researchers to recognize forthcoming changes in environment, ecology, and population. Ornithologists spend many hours observing and recording birds in their natural habitat to compare different audio samples and extract valuable insights. This manual process is typically undertaken by highly-experienced birders that identify every species and its associated type of sound. In recent years, some public repositories hosting labelled acoustic samples from different bird species have emerged, which has resulted in appealing datasets that computer scientists can use to test the accuracy of their machine learning algorithms and assist ornithologists in the time-consuming process of analyzing audio data. Current limitations in the performance of these algorithms come from the fact that the acoustic samples of these datasets combine fragments with only environmental noise and fragments with the bird sound (i.e., the computer confuses environmental sound with the bird sound). Therefore, the purpose of this paper is to release a dataset lasting more than 4984 s that contains differentiated samples of (1) bird sounds and (2) environmental sounds. This data descriptor releases the processed audio samples—originally obtained from the Xeno-Canto repository—from the known seven families of the Picidae species inhabiting the Iberian Peninsula that are good indicators of the habitat quality and have significant value from the environment conservation point of view.
KW - Acoustic bird recognition
KW - Audio sample
KW - Birdsong
KW - Birdsong dataset
KW - Dataset
UR - http://www.scopus.com/inward/record.url?scp=85052060471&partnerID=8YFLogxK
U2 - 10.3390/data2020018
DO - 10.3390/data2020018
M3 - Article
AN - SCOPUS:85052060471
SN - 2306-5729
VL - 2
JO - Data
JF - Data
IS - 2
M1 - 18
ER -