Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Emphatic visual speech synthesis

  • Javier Melenchón*
  • , Elisa Martínez
  • , Fernando De La Torre
  • , José A. Montero
  • *Autor/a de correspondencia de este trabajo

Producción científica: Artículo en revista indizadaArtículorevisión exhaustiva

13 Citas (Scopus)

Resumen

The synthesis of talking heads has been a flourishing, , research area over the last few years. Since human beings have, , an uncanny ability to read people's faces, most related applications, , (e.g., advertising, video-teleconferencing) require absolutely, , realistic photometric and behavioral synthesis of faces. This paper, , proposes a person-specific facial synthesis framework that allows, , high realism and includes a novel way to control visual emphasis, , (e.g., level of exaggeration of visible articulatory movements of the, , vocal tract). There are three main contributions: a geodesic interpolation, , with visual unit selection, a parameterization of visual emphasis, , , and the design of minimum size corpora. Perceptual tests, , with human subjects reveal high realism properties, achieving similar, , perceptual scores as real samples. Furthermore, the visual emphasis, , level and two communication styles show a statistical interaction, , relationship.

Idioma originalInglés
Páginas (desde-hasta)459-468
Número de páginas10
PublicaciónIEEE Transactions on Audio, Speech and Language Processing
Volumen17
N.º3
DOI
EstadoPublicada - mar 2009

Huella

Profundice en los temas de investigación de 'Emphatic visual speech synthesis'. En conjunto forman una huella única.

Cómo citar