Adding singing capabilities to unit selection TTS through HNM-based conversion

Producció científica: Capítol de llibreContribució a congrés/conferènciaAvaluat per experts

1 Citació (Scopus)

Resum

Adding singing capabilities to a corpus-based concatenative text-to-speech (TTS) system can be addressed by explicitly collecting singing samples from the previously recorded speaker. However, this approach is only feasible if the considered speaker is also a singing talent. As an alternative, we consider appending a Harmonic plus Noise Model (HNM) speech-to-singing conversion module to a Unit Selection TTS (US-TTS) system. Two possible text-to-speech-to-singing synthesis approaches are studied: applying the speech-to-singing conversion to the US-TTS synthetic output, or implementing a hybrid US+HNM synthesis framework. The perceptual tests show that the speech-to-singing conversion yields similar singing resemblance than the natural version, but with lower naturalness. Moreover, no statistically significant differences are found between both strategies in terms of naturalness nor singing resemblance. Finally, the hybrid approach allows reducing more than twice the overall computational cost.

Idioma originalAnglès
Títol de la publicacióAdvances in Speech and Language Technologies for Iberian Languages - 3rd International Conference, IberSPEECH 2016, Proceedings
EditorsCarmen Garcia Mateo, Alfonso Ortega, Alberto Abad, Nuno Mamede, Carlos D. Martínez Hinarejos, Antonio Teixeira, Fernando Batista, Fernando Perdigao
EditorSpringer Verlag
Pàgines33-43
Nombre de pàgines11
ISBN (imprès)9783319491684
DOIs
Estat de la publicacióPublicada - 2016
Esdeveniment3rd International Conference on Advances in Speech and Language Technologies for Iberian Languages, IberSPEECH 2016 - PRT, Turkey
Durada: 23 de nov. 201625 de nov. 2016

Sèrie de publicacions

NomLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volum10077 LNAI
ISSN (imprès)0302-9743
ISSN (electrònic)1611-3349

Conferència

Conferència3rd International Conference on Advances in Speech and Language Technologies for Iberian Languages, IberSPEECH 2016
País/TerritoriTurkey
CiutatPRT
Període23/11/1625/11/16

Fingerprint

Navegar pels temes de recerca de 'Adding singing capabilities to unit selection TTS through HNM-based conversion'. Junts formen un fingerprint únic.

Com citar-ho