Prediction of the acoustic comfort of a dwelling based on automatic sound event detection

Daniel Bonet-Solà, Ester Vidaña-Vila, Rosa Ma Alsina-Pagès

Producció científica: Article en revista indexadaArticleAvaluat per experts

1 Citació (Scopus)

Resum

There is an increasing concern about noise pollution around the world. As a first step to tackling the problem of deteriorated urban soundscapes, this article aims to develop a tool that automatically evaluates the soundscape quality of dwellings based on the acoustic events obtained from short videos recorded on-site. A sound event classifier based on a convolutional neural network has been used to detect the sounds present in those videos. Once the events are detected, our distinctive approach proceeds in two steps. First, the detected acoustic events are employed as inputs in a binary assessment system, utilizing logistic regression to predict whether the user’s perception of the soundscape (and, therefore, the soundscape quality estimator) is categorized as “comfortable” or “uncomfortable”. Additionally, an Acoustic Comfort Index (ACI) on a scale of 1–5 is estimated, facilitated by a linear regression model. The system achieves an accuracy value over 80% in predicting the subjective opinion of citizens based only on the automatic sound event detected on their balconies. The ultimate goal is to be able to predict an ACI on new locations using solely a 30-s video as an input. The potential of the tool might offer data-driven insights to map the annoyance or the pleasantness of the acoustic environment for people, and gives the possibility to support the administration to mitigate noise pollution and enhance urban living conditions, contributing to improved well-being and community engagement.

Idioma originalAnglès
Número d’article20220177
RevistaNoise Mapping
Volum10
Número1
DOIs
Estat de la publicacióPublicada - 1 de gen. 2023
Publicat externament

Fingerprint

Navegar pels temes de recerca de 'Prediction of the acoustic comfort of a dwelling based on automatic sound event detection'. Junts formen un fingerprint únic.

Com citar-ho