High quality emotional HMM-based synthesis in Spanish

Xavi Gonzalvo, Paul Taylor, Carlos Monzo, Ignasi Iriondo, Joan Claudi Socoró

Producción científica: Capítulo del libroContribución a congreso/conferenciarevisión exhaustiva

3 Citas (Scopus)

Resumen

This paper describes a high-quality Spanish HMM-based speech synthesis of emotional speaking styles. The quality of the HMM-based speech synthesis is enhanced by using the most recent features presented for the Blizzard system (i.e. STRAIGHT spectrum extraction and mixed excitation). Two techniques are evaluated. First, a method simultaneously model all emotions within a single acoustic model. Second, an adaptation techniques to convert a neutral emotional style to a target emotion. We consider 3 kinds of emotions expressions: neutral, happy and sad. A subjective evaluation will show the quality of the system and the intensity of the produced emotion while an objective evaluation based on voice quality parameters evaluates the effectiveness of the approaches.

Idioma originalInglés
Título de la publicación alojadaAdvances in Nonlinear Speech Processing - International Conference on Nonlinear Speech Processing, NOLISP 2009, Revised Selected Papers
Páginas26-34
Número de páginas9
DOI
EstadoPublicada - 2010
EventoInternational Conference on Nonlinear Speech Processing, NOLISP 2009 - Vic, Espana
Duración: 25 jun 200927 jun 2009

Serie de la publicación

NombreLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen5933 LNAI
ISSN (versión impresa)0302-9743
ISSN (versión digital)1611-3349

Conferencia

ConferenciaInternational Conference on Nonlinear Speech Processing, NOLISP 2009
País/TerritorioEspana
CiudadVic
Período25/06/0927/06/09

Huella

Profundice en los temas de investigación de 'High quality emotional HMM-based synthesis in Spanish'. En conjunto forman una huella única.

Citar esto