Vocal effort modification through harmonics plus noise model representation

Àngel Calzada, Joan Claudi Socoró

Research output: Book chapterConference contributionpeer-review

6 Citations (Scopus)

Abstract

Harmonics plus Noise Model (HNM) is a well known speech signal representation technique that allows to apply high quality modifications to the signal used in text-to-speech systems providing higher flexibility than its counterpart TD-PSOLA based synthesis systems. In this paper an adaptation of the adaptive pre-emphasis linear prediction technique for modifying the vocal effort, using HNM speech representation, is presented. The proposed transformation methodology is validated using a Copy Re-synthesis strategy on a speech corpora specifically designed with three levels of vocal effort (soft, modal and loud). The results of a perceptual test demonstrate the effectiveness of the proposed technique performing all different vocal effort conversions for the given corpus.

Original languageEnglish
Title of host publicationAdvances in Nonlinear Speech Processing - 5th International Conference on Nonlinear Speech Processing, NOLISP 2011, Proceedings
Pages96-103
Number of pages8
DOIs
Publication statusPublished - 2011
Event5th International Conference on Nonlinear Speech Processing, NOLISP 2011 - Las Palmas de Gran Canaria, Spain
Duration: 7 Nov 20119 Nov 2011

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7015 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference5th International Conference on Nonlinear Speech Processing, NOLISP 2011
Country/TerritorySpain
CityLas Palmas de Gran Canaria
Period7/11/119/11/11

Keywords

  • expressive speech
  • harmonics plus noise model
  • speech conversion
  • speech synthesis
  • vocal effort
  • voice quality

Fingerprint

Dive into the research topics of 'Vocal effort modification through harmonics plus noise model representation'. Together they form a unique fingerprint.

Cite this