Local minimum generation error criterion for hybrid HMM speech synthesis

Xavi Gonzalvo*, Alexander Gutkin, Joan Claudi Socoró, Ignasi Iriondo, Paul Taylor

*Autor corresponent d’aquest treball

Producció científica: Article en revista indexadaArticle de conferènciaAvaluat per experts

9 Cites (Scopus)

Resum

This paper presents an HMM-driven hybrid speech synthesis approach in which unit selection concatenative synthesis is used to improve the quality of the statistical system using a Local Minimum Generation Error (LMGE) during the synthesis stage. The idea behind this approach is to combine the robustness due to HMMs with the naturalness of concatenated units. Unlike the conventional hybrid approaches to speech synthesis that use concatenative synthesis as a backbone, the proposed system employs stable regions of natural units to improve the statistically generated parameters. We show that this approach improves the generation of vocal tract parameters, smoothes the bad joints and increases the overall quality.

Idioma originalAnglès
Pàgines (de-a)416-419
Nombre de pàgines4
RevistaProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Estat de la publicacióPublicada - 2009
Esdeveniment10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009 - Brighton, United Kingdom
Durada: 6 de set. 200910 de set. 2009

Fingerprint

Navegar pels temes de recerca de 'Local minimum generation error criterion for hybrid HMM speech synthesis'. Junts formen un fingerprint únic.

Com citar-ho