High quality emotional HMM-based synthesis in Spanish

Xavi Gonzalvo, Paul Taylor, Carlos Monzo, Ignasi Iriondo, Joan Claudi Socoró

Research output: Book chapterConference contributionpeer-review

3 Citations (Scopus)

Abstract

This paper describes a high-quality Spanish HMM-based speech synthesis of emotional speaking styles. The quality of the HMM-based speech synthesis is enhanced by using the most recent features presented for the Blizzard system (i.e. STRAIGHT spectrum extraction and mixed excitation). Two techniques are evaluated. First, a method simultaneously model all emotions within a single acoustic model. Second, an adaptation techniques to convert a neutral emotional style to a target emotion. We consider 3 kinds of emotions expressions: neutral, happy and sad. A subjective evaluation will show the quality of the system and the intensity of the produced emotion while an objective evaluation based on voice quality parameters evaluates the effectiveness of the approaches.

Original languageEnglish
Title of host publicationAdvances in Nonlinear Speech Processing - International Conference on Nonlinear Speech Processing, NOLISP 2009, Revised Selected Papers
Pages26-34
Number of pages9
DOIs
Publication statusPublished - 2010
EventInternational Conference on Nonlinear Speech Processing, NOLISP 2009 - Vic, Spain
Duration: 25 Jun 200927 Jun 2009

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5933 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceInternational Conference on Nonlinear Speech Processing, NOLISP 2009
Country/TerritorySpain
CityVic
Period25/06/0927/06/09

Keywords

  • Adaptation
  • Emotion
  • HMM-based speech synthesis

Fingerprint

Dive into the research topics of 'High quality emotional HMM-based synthesis in Spanish'. Together they form a unique fingerprint.

Cite this