Evolutionary weight tuning based on diphone pairs for unit selection speech synthesis

Francesc Alías, Xavier Llorà

Producció científica: Contribució a una conferènciaContribucióAvaluat per experts

12 Cites (Scopus)

Resum

Unit selection text-to-speech (TTS) conversion is an ongoing research for the speech synthesis community. This paper is focused on tuning the weights involved in the target concatenation cost metrics. We propose a method for automatically adjusting these weights simultaneously by means of diphone and triphone pairs. This method is based on techniques provided by the evolutionary computation community, taking advantage of their robustness in noisy domains. The experiments and their analyses demonstrate its good performance in this problem, thus, overcoming some constraints assumed by previous works leading to a new interesting framework for further investigations.

Idioma originalAnglès
Pàgines1333-1336
Nombre de pàgines4
Estat de la publicacióPublicada - 2003
Esdeveniment8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 - Geneva, Switzerland
Durada: 1 de set. 20034 de set. 2003

Conferència

Conferència8th European Conference on Speech Communication and Technology, EUROSPEECH 2003
País/TerritoriSwitzerland
CiutatGeneva
Període1/09/034/09/03

Fingerprint

Navegar pels temes de recerca de 'Evolutionary weight tuning based on diphone pairs for unit selection speech synthesis'. Junts formen un fingerprint únic.

Com citar-ho