Resumen
In this paper we present a speech synthesis method for diphonebased text-to-speech systems. Its main goal is to achieve prosodic modifications that result in more natural-sounding synthetic speech. This improvement is especially useful for emotional speech synthesis, which requires high-quality prosodic modification. We present a hybrid method based on TD-PSOLA and the harmonic plus noise model, which incorporates a novel method to jointly modify pitch and time-scale. Preliminary results show an improvement in the synthetic speech quality when high pitch modification is required.
| Idioma original | Inglés |
|---|---|
| Páginas | 2953-2956 |
| Número de páginas | 4 |
| Estado | Publicada - 2003 |
| Evento | 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 - Geneva, Suiza Duración: 1 sept 2003 → 4 sept 2003 |
Conferencia
| Conferencia | 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 |
|---|---|
| País/Territorio | Suiza |
| Ciudad | Geneva |
| Período | 1/09/03 → 4/09/03 |
Huella
Profundice en los temas de investigación de 'A hybrid method oriented to concatenative text-to-speech synthesis'. En conjunto forman una huella única.Cómo citar
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver