TY - JOUR
T1 - A semi-polar grid strategy for the three-dimensional finite element simulation of vowel-vowel sequences
AU - Arnela, Marc
AU - Dabbaghchian, Saeed
AU - Guasch, Oriol
AU - Engwall, Olov
N1 - Funding Information:
This research has partially been supported by EU-FET grant EUNISON 308874. The first and third authors also acknowledge the Agencia Estatal de Investigación (AEI) and FEDER, EU, through project GENIOVOX TEC2016-81107-P, the grant 2014-SGR-0590 from the Secretaria d’Universitats i Recerca del Departament d’Economia i Coneixement (Generalitat de Catalunya), and the support of grants 2016-URL-IR-010 and 2016-URL-IR-013 from the Generalitat de Catalunya and the Universitat Ramon Llull.
Publisher Copyright:
Copyright © 2017 ISCA.
PY - 2017
Y1 - 2017
N2 - Three-dimensional computational acoustic models need very detailed 3D vocal tract geometries to generate high quality sounds. Static geometries can be obtained from Magnetic Resonance Imaging (MRI), but it is not currently possible to capture dynamic MRI-based geometries with sufficient spatial and time resolution. One possible solution consists in interpolating between static geometries, but this is a complex task. We instead propose herein to use a semi-polar grid to extract 2D cross-sections from the static 3D geometries, and then interpolate them to obtain the vocal tract dynamics. Other approaches such as the adaptive grid have also been explored. In this method, cross-sections are defined perpendicular to the vocal tract midline, as typically done in 1D to obtain the vocal tract area functions. However, intersections between adjacent cross-sections may occur during the interpolation process, especially when the vocal tract midline quickly changes its orientation. In contrast, the semi-polar grid prevents these intersections because the plane orientations are fixed over time. Finite element simulations of static vowels are first conducted, showing that 3D acoustic wave propagation is not significantly altered when the semi-polar grid is used instead of the adaptive grid. The vowel-vowel sequence [Ai] is finally simulated to demonstrate the method.
AB - Three-dimensional computational acoustic models need very detailed 3D vocal tract geometries to generate high quality sounds. Static geometries can be obtained from Magnetic Resonance Imaging (MRI), but it is not currently possible to capture dynamic MRI-based geometries with sufficient spatial and time resolution. One possible solution consists in interpolating between static geometries, but this is a complex task. We instead propose herein to use a semi-polar grid to extract 2D cross-sections from the static 3D geometries, and then interpolate them to obtain the vocal tract dynamics. Other approaches such as the adaptive grid have also been explored. In this method, cross-sections are defined perpendicular to the vocal tract midline, as typically done in 1D to obtain the vocal tract area functions. However, intersections between adjacent cross-sections may occur during the interpolation process, especially when the vocal tract midline quickly changes its orientation. In contrast, the semi-polar grid prevents these intersections because the plane orientations are fixed over time. Finite element simulations of static vowels are first conducted, showing that 3D acoustic wave propagation is not significantly altered when the semi-polar grid is used instead of the adaptive grid. The vowel-vowel sequence [Ai] is finally simulated to demonstrate the method.
KW - Dynamic vocal tract
KW - Finite Element Method
KW - Semi-polar grid
KW - Speech synthesis
KW - Vocal tract acoustics
KW - Voice production
KW - Vowel-vowel sequences
UR - http://www.scopus.com/inward/record.url?scp=85039147985&partnerID=8YFLogxK
U2 - 10.21437/Interspeech.2017-448
DO - 10.21437/Interspeech.2017-448
M3 - Conference article
AN - SCOPUS:85039147985
SN - 2308-457X
VL - 2017-August
SP - 3477
EP - 3481
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
T2 - 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017
Y2 - 20 August 2017 through 24 August 2017
ER -