Articulatory speech synthesis from the fluid dynamics of the vocal apparatus [electronic resource] / Stephen Levinson [and others].
- Published
- San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA) : Morgan & Claypool, [2012]
- Copyright Date
- ©2012
- Physical Description
- 1 electronic text (xii, 104 pages) : illustrations, digital file
- Additional Creators
- Levinson, Stephen E.
Access Online
- Abstract with links to full text: ezaccess.libraries.psu.edu
- Series
- Restrictions on Access
- Abstract freely available; full-text restricted to subscribers or individual document purchasers.
- Contents
- 1. Introduction -- 1.1 History of speech synthesis -- 1.2 Speech production -- 1.3 Contributions -- 1.4 Organization of the book --, 2. Literature review -- 2.1 Overview of speech synthesis techniques -- 2.1.1 Concatenative synthesis -- 2.1.2 Formant synthesis -- 2.1.3 Articulatory synthesis -- 2.2 Overview of speech production model -- 2.2.1 Source-filter speech production model -- 2.2.2 Fricative model -- 2.2.3 Unvoiced speech sound production model -- 2.3 Overview of articulatory speech model -- 2.3.1 Coker's model -- 2.3.2 Synthesis of speech phonemes -- 2.3.3 Mermelstein's model -- 2.3.4 Task-dynamic model -- 2.4 Overview of the motor control of the articulator -- 2.4.1 A dynamic model of articulation -- 2.4.2 Motor control based on minimum cost principles -- 2.5 Summary --, 3. Estimation of dynamic articulatory parameters -- 3.1 Cubic spline method -- 3.2 Review of the signal representation techniques -- 3.2.1 Introduction -- 3.2.2 L2 space -- 3.2.3 Convolution-based signal representations -- 3.2.4 Interpolation and quasi-interpolation -- 3.2.5 Convolution-based least squares -- 3.2.6 Strang-fix conditions -- 3.3 Pointwise error analysis -- 3.3.1 Interpolation error -- 3.3.2 Least squares error -- 3.4 L2 error analysis -- 3.4.1 L2 error of quasi-interpolation -- 3.4.2 L2 error of the LS approximation -- 3.4.3 Comparison -- 3.5 Experimental results -- 3.6 Discussion -- 3.7 Future work -- 3.8 Summary --, 4. Construction of articulatory model based on MRI data -- 4.1 Problem formulation -- 4.2 Vocal cords models -- 4.3 Multi-mass model -- 4.4 Simulation result and future work --, 5. Vocal fold excitation models -- 5.1 Parametric models -- 5.1.1 Rosenberg's model -- 5.1.2 Titze's model -- 5.2 Mechanical model -- 5.2.1 Two-mass model -- 5.2.2 M-mass model -- 5.3 Simulation results -- 5.4 Discussion -- 5.5 Summary --, 6. Experimental results of articulatory synthesis -- 6.1 Governing equations, fluid dynamics analysis -- 6.2 Synthesized waveform -- 6.3 Speech analysis results -- 6.3.1 LPC spectrum and the short-time power spectrum -- 6.3.2 Spectrogram -- 6.4 Analysis of the velocity, vorticity, and pressure fields -- 6.5 Summary --, and 7. Conclusion -- Bibliography -- Authors' biographies.
- Summary
- This book addresses the problem of articulatory speech synthesis based on computed vocal tract geometries and the basic physics of sound production in it. Unlike conventional methods based on analysis/synthesis using the well-known source filter model, which assumes the independence of the excitation and filter,we treat the entire vocal apparatus as one mechanical system that produces sound by means of fluid dynamics. The vocal apparatus is represented as a three-dimensional time-varying mechanism and the sound propagation inside it is due to the non-planar propagation of acoustic waves through a viscous, compressible fluid described by the Navier-Stokes equations. We propose a combined minimum energy and minimum jerk criterion to compute the dynamics of the vocal tract during articulation. Theoretical error bounds and experimental results show that this method obtains a close match to the phonetic target positions while avoiding abrupt changes in the articulatory trajectory. The vocal folds are set into aerodynamic oscillation by the flow of air from the lungs. The modulated air stream then excites the moving vocal tract. This method shows strong evidence for source-filter interaction. Based on our results, we propose that the articulatory speech production model has the potential to synthesize speech and provide a compact parameterization of the speech signal that can be useful in a wide variety of speech signal processing problems.
- Subject(s)
- Other Subject(s)
- ISBN
- 9781598291797 (electronic bk.)
9781598291780 (pbk.) - Note
- Part of: Synthesis digital library of engineering and computer science.
Series from website.
AVAILABLE ONLINE TO AUTHORIZED PSU USERS. - Bibliography Note
- Includes bibliographical references (pages 95-101).
- Other Forms
- Also available in print.
- Technical Details
- Mode of access: World Wide Web.
System requirements: Adobe Acrobat Reader. - Indexed By
- Compendex
INSPEC
Google scholar
Google book search
View MARC record | catkey: 12563911