The role of prosody and voice quality in text-dependent categories of storytelling across languages

Raúl Montaño, Francesc Alías

Producció científica: Article en revista indexadaArticle de conferènciaAvaluat per experts

Resum

In contrast to full-blown emotions, storytelling speech entails a particular speaking style that contains subtle expressive nuances of which little is known. In the present work, we study the role of prosody and voice quality while searching for crosslinguistic acoustic similarities in two categories of storytelling speech that are defined by their lexical components: The descriptive mode and sentences that specify a character intervention, together with a third neutral category (perceptually validated as reference). The study addresses four narrators using four different European languages (English, French, German and Spanish) expressing the same story. After conducting several statistical and discriminant analyses, we find that all narrators under analysis exploit some acoustic parameters in a similar way to differentiate among the analysed storytelling categories. Specifically, we observe that three prosodic features (mean fundamental frequency, mean intensity and number of silent pauses) and two voice quality parameters (mean Harmonic-to-Noise Ratio and Maxima Dispersion Quotient) explain a relatively similar proportion of the variance among storytelling categories in all languages. Moreover, the classification results obtained from the discriminant analysis are comparable for the three considered storytelling categories across languages.

Idioma originalAnglès
Pàgines (de-a)1186-1190
Nombre de pàgines5
RevistaProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volum2015-January
Estat de la publicacióPublicada - 2015
Esdeveniment16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015 - Dresden, Germany
Durada: 6 de set. 201510 de set. 2015

Fingerprint

Navegar pels temes de recerca de 'The role of prosody and voice quality in text-dependent categories of storytelling across languages'. Junts formen un fingerprint únic.

Com citar-ho