Evolutionary weight tuning based on diphone pairs for unit selection speech synthesis

Francesc Alías, Xavier Llorà

Producción científica: Contribución a una conferenciaContribuciónrevisión exhaustiva

12 Citas (Scopus)

Resumen

Unit selection text-to-speech (TTS) conversion is an ongoing research for the speech synthesis community. This paper is focused on tuning the weights involved in the target concatenation cost metrics. We propose a method for automatically adjusting these weights simultaneously by means of diphone and triphone pairs. This method is based on techniques provided by the evolutionary computation community, taking advantage of their robustness in noisy domains. The experiments and their analyses demonstrate its good performance in this problem, thus, overcoming some constraints assumed by previous works leading to a new interesting framework for further investigations.

Idioma originalInglés
Páginas1333-1336
Número de páginas4
EstadoPublicada - 2003
Evento8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 - Geneva, Suiza
Duración: 1 sept 20034 sept 2003

Conferencia

Conferencia8th European Conference on Speech Communication and Technology, EUROSPEECH 2003
País/TerritorioSuiza
CiudadGeneva
Período1/09/034/09/03

Huella

Profundice en los temas de investigación de 'Evolutionary weight tuning based on diphone pairs for unit selection speech synthesis'. En conjunto forman una huella única.

Citar esto