Evolutionary weight tuning based on diphone pairs for unit selection speech synthesis

Francesc Alías, Xavier Llorà

Research output: Conference paperContributionpeer-review

12 Citations (Scopus)

Abstract

Unit selection text-to-speech (TTS) conversion is an ongoing research for the speech synthesis community. This paper is focused on tuning the weights involved in the target concatenation cost metrics. We propose a method for automatically adjusting these weights simultaneously by means of diphone and triphone pairs. This method is based on techniques provided by the evolutionary computation community, taking advantage of their robustness in noisy domains. The experiments and their analyses demonstrate its good performance in this problem, thus, overcoming some constraints assumed by previous works leading to a new interesting framework for further investigations.

Original languageEnglish
Pages1333-1336
Number of pages4
Publication statusPublished - 2003
Event8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 - Geneva, Switzerland
Duration: 1 Sept 20034 Sept 2003

Conference

Conference8th European Conference on Speech Communication and Technology, EUROSPEECH 2003
Country/TerritorySwitzerland
CityGeneva
Period1/09/034/09/03

Fingerprint

Dive into the research topics of 'Evolutionary weight tuning based on diphone pairs for unit selection speech synthesis'. Together they form a unique fingerprint.

Cite this