Resumen
The quality of corpus based text-to-speech systems depends on the accuracy of the unit selection process, which relies on the values of the weights of the cost function. This paper is focused on defining a new framework for the tuning of these weights. We propose a technique for taking into account the subjective perception of speech in the selection process by means of Interactive Genetic Algorithms. Moreover, we introduce a CART-based method for unit clustering. Both techniques are applied to weight tuning based on diphone pairs. The conducted experiments analyze the feasibility of both proposals separately.
| Idioma original | Inglés |
|---|---|
| Páginas | 1221-1224 |
| Número de páginas | 4 |
| Estado | Publicada - 2004 |
| Evento | 8th International Conference on Spoken Language Processing, ICSLP 2004 - Jeju, Jeju Island, República de Corea Duración: 4 oct 2004 → 8 oct 2004 |
Conferencia
| Conferencia | 8th International Conference on Spoken Language Processing, ICSLP 2004 |
|---|---|
| País/Territorio | República de Corea |
| Ciudad | Jeju, Jeju Island |
| Período | 4/10/04 → 8/10/04 |
Huella
Profundice en los temas de investigación de 'Perception-guided and phonetic clustering weight tuning based on diphone pairs for unit selection TTS'. En conjunto forman una huella única.Cómo citar
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver