Extracting user preferences by GTM for aiGA weight tuning in unit selection text-to-speech synthesis

Lluís Formiga, Francesc Alías

Research output: Book chapterConference contributionpeer-review

7 Citations (Scopus)

Abstract

Unit-selection based Text-to-Speech synthesis systems aim to obtain high quality synthetic speech by optimally selecting previously recorded units. To that effect these units are selected by a dynamic programming algorithm guided through a weighted cost function. Thus, in this context, weights should be tuned perceptually so as to be in agreement with perception from listening users. In previous works we have proposed to subjectively tune these weights through an interactive evolutionary process, also known as Active Interactive Genetic Algorithm (aiGA). The problem comes out when different users, although being consistent, evolve to different weight configurations. In this proof-of-principle work, Generative Topographic Mapping (GTM) is introduced as a method to extract knowledge from user specific preferences. The experiments show that GTM is able to capture user preferences, thus, avoiding selecting the best evolved weight configuration by means of a second preference test.

Original languageEnglish
Title of host publicationComputational and Ambient Intelligence - 9th International Work-Conference on Artificial Neural Networks, IWANN 2007, Proceedings
PublisherSpringer Verlag
Pages654-661
Number of pages8
ISBN (Print)9783540730064
DOIs
Publication statusPublished - 2007
Event9th International Work-Conference on Artificial Neural Networks, IWANN 2007 - San Sebastian, Spain
Duration: 20 Jun 200722 Jun 2007

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4507 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference9th International Work-Conference on Artificial Neural Networks, IWANN 2007
Country/TerritorySpain
CitySan Sebastian
Period20/06/0722/06/07

Fingerprint

Dive into the research topics of 'Extracting user preferences by GTM for aiGA weight tuning in unit selection text-to-speech synthesis'. Together they form a unique fingerprint.

Cite this