High quality emotional HMM-based synthesis in Spanish

Xavi Gonzalvo, Paul Taylor, Carlos Monzo, Ignasi Iriondo, Joan Claudi Socoró

Producció científica: Capítol de llibreContribució a una conferènciaAvaluat per experts

3 Cites (Scopus)

Resum

This paper describes a high-quality Spanish HMM-based speech synthesis of emotional speaking styles. The quality of the HMM-based speech synthesis is enhanced by using the most recent features presented for the Blizzard system (i.e. STRAIGHT spectrum extraction and mixed excitation). Two techniques are evaluated. First, a method simultaneously model all emotions within a single acoustic model. Second, an adaptation techniques to convert a neutral emotional style to a target emotion. We consider 3 kinds of emotions expressions: neutral, happy and sad. A subjective evaluation will show the quality of the system and the intensity of the produced emotion while an objective evaluation based on voice quality parameters evaluates the effectiveness of the approaches.

Idioma originalAnglès
Títol de la publicacióAdvances in Nonlinear Speech Processing - International Conference on Nonlinear Speech Processing, NOLISP 2009, Revised Selected Papers
Pàgines26-34
Nombre de pàgines9
DOIs
Estat de la publicacióPublicada - 2010
EsdevenimentInternational Conference on Nonlinear Speech Processing, NOLISP 2009 - Vic, Spain
Durada: 25 de juny 200927 de juny 2009

Sèrie de publicacions

NomLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volum5933 LNAI
ISSN (imprès)0302-9743
ISSN (electrònic)1611-3349

Conferència

ConferènciaInternational Conference on Nonlinear Speech Processing, NOLISP 2009
País/TerritoriSpain
CiutatVic
Període25/06/0927/06/09

Fingerprint

Navegar pels temes de recerca de 'High quality emotional HMM-based synthesis in Spanish'. Junts formen un fingerprint únic.

Com citar-ho