Emphatic visual speech synthesis

Javier Melenchón, Elisa Martínez, Fernando De La Torre, José A. Montero

Producció científica: Article en revista indexadaArticleAvaluat per experts

13 Cites (Scopus)


The synthesis of talking heads has been a flourishing, , research area over the last few years. Since human beings have, , an uncanny ability to read people's faces, most related applications, , (e.g., advertising, video-teleconferencing) require absolutely, , realistic photometric and behavioral synthesis of faces. This paper, , proposes a person-specific facial synthesis framework that allows, , high realism and includes a novel way to control visual emphasis, , (e.g., level of exaggeration of visible articulatory movements of the, , vocal tract). There are three main contributions: a geodesic interpolation, , with visual unit selection, a parameterization of visual emphasis, , , and the design of minimum size corpora. Perceptual tests, , with human subjects reveal high realism properties, achieving similar, , perceptual scores as real samples. Furthermore, the visual emphasis, , level and two communication styles show a statistical interaction, , relationship.

Idioma originalAnglès
Pàgines (de-a)459-468
Nombre de pàgines10
RevistaIEEE Transactions on Audio, Speech and Language Processing
Estat de la publicacióPublicada - de març 2009


Navegar pels temes de recerca de 'Emphatic visual speech synthesis'. Junts formen un fingerprint únic.

Com citar-ho