Expressive speech style transformation: Voice quality and prosody modification using a harmonic plus noise model

Carlos Monzo, Àngel Calzada, Ignasi Iriondo, Joan Claudi Socoró

Research output: Book chapterConference contributionpeer-review

4 Citations (Scopus)

Abstract

This paper proposes an approach to transform speech from a neutral style into other expressive styles using both prosody and voice quality (VoQ). The main aim is to validate the usefulness of VoQ in the enhancement of expressive synthetic speech. A Harmonic plus Noise Model (HNM) is used to modify speech following a set of rules extracted from an expressive speech corpus with five categories (neutral, happy, sensual, aggressive and sad). Finally, modified speech utterances were used to perform a perceptual test. These results indicate that listeners prefer prosody together with VoQ transformation instead of only prosody modification.

Original languageEnglish
Title of host publication5th International Conference on Speech Prosody 2010
PublisherInternational Speech Communications Association
ISBN (Electronic)9780000000002
Publication statusPublished - 2010
Event5th International Conference on Speech Prosody: Every Language, Every Style, SP 2010 - Chicago, United States
Duration: 10 May 201014 May 2010

Publication series

NameProceedings of the International Conference on Speech Prosody
ISSN (Print)2333-2042

Conference

Conference5th International Conference on Speech Prosody: Every Language, Every Style, SP 2010
Country/TerritoryUnited States
CityChicago
Period10/05/1014/05/10

Keywords

  • Expressive speech transformation
  • Harmonic plus Noise Model
  • Prosody
  • Voice quality

Fingerprint

Dive into the research topics of 'Expressive speech style transformation: Voice quality and prosody modification using a harmonic plus noise model'. Together they form a unique fingerprint.

Cite this