A hybrid method oriented to concatenative text-to-speech synthesis

Ignasi Iriondo, Francesc Alías, Javier Sanchis, Javier Melenchón

Producció científica: Contribució a una conferènciaArticleAvaluat per experts

3 Cites (Scopus)

Resum

In this paper we present a speech synthesis method for diphonebased text-to-speech systems. Its main goal is to achieve prosodic modifications that result in more natural-sounding synthetic speech. This improvement is especially useful for emotional speech synthesis, which requires high-quality prosodic modification. We present a hybrid method based on TD-PSOLA and the harmonic plus noise model, which incorporates a novel method to jointly modify pitch and time-scale. Preliminary results show an improvement in the synthetic speech quality when high pitch modification is required.

Idioma originalAnglès
Pàgines2953-2956
Nombre de pàgines4
Estat de la publicacióPublicada - 2003
Esdeveniment8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 - Geneva, Switzerland
Durada: 1 de set. 20034 de set. 2003

Conferència

Conferència8th European Conference on Speech Communication and Technology, EUROSPEECH 2003
País/TerritoriSwitzerland
CiutatGeneva
Període1/09/034/09/03

Fingerprint

Navegar pels temes de recerca de 'A hybrid method oriented to concatenative text-to-speech synthesis'. Junts formen un fingerprint únic.

Com citar-ho