Efficient interactive weight tuning for tts synthesis: Reducing user fatigue by improving user consistency

Francesc Alías, Xavier Llorà, Lluís Formiga, Kumara Sastry, David E. Goldberg

Producción científica: Capítulo del libroContribución a congreso/conferenciarevisión exhaustiva

9 Citas (Scopus)

Resumen

The quality of corpus-based text-to-speech systems depends on the accuracy of the unit selection process, which in turn relies on the cost function definition. This function should map the user perceptual preference when selecting synthesis units, which is a very difficult task. This paper continues our previous work on fusing the human judgements with the cost function by means of interactive weight tuning. The application of active interactive genetics algorithms mitigates user fatigue by improving user consistency. As a result, the obtained weights generate more natural synthetic speech when compared to previous objective and subjective proposals.

Idioma originalInglés
Título de la publicación alojada2006 IEEE International Conference on Acoustics, Speech, and Signal Processing - Proceedings
PáginasI865-I868
EstadoPublicada - 2006
Evento2006 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2006 - Toulouse, Francia
Duración: 14 may 200619 may 2006

Serie de la publicación

NombreICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volumen1
ISSN (versión impresa)1520-6149

Conferencia

Conferencia2006 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2006
País/TerritorioFrancia
CiudadToulouse
Período14/05/0619/05/06

Huella

Profundice en los temas de investigación de 'Efficient interactive weight tuning for tts synthesis: Reducing user fatigue by improving user consistency'. En conjunto forman una huella única.

Citar esto