Filters








1,273 Hits in 9.0 sec

Impact of Vocal Tract Resonance on the Perception of Voice Quality Changes Caused by Varying Vocal Fold Stiffness

Rosario Signorello, Zhaoyan Zhang, Bruce Gerratt, Jody Kreiman
2016 Acta Acustica united with Acustica  
Experiments using animal and human larynx models are often conducted without a vocal tract.  ...  Each series included a set of stimuli created with a physical vocal tract, and a second set created without a physical vocal tract.  ...  R01 DC011299 and R01 DC001797 from the National Institute on Deafness and Other Communication Disorders, the National Institutes of Health. We thank Shaghayegh Rastifar for testing listeners.  ... 
doi:10.3813/aaa.918937 pmid:27134616 pmcid:PMC4845961 fatcat:tvb4cl37nnbqffvgznyszuge5i

A Simplified Model for the Vocal Tract of [s] with Inclined Incisors

Tsukasa Yoshinaga, Kohei Tada, Kazunori Nozaki, Akiyoshi Iida
2021 Conference of the International Speech Communication Association  
As a control model, a realistic vocal tract replica of [s] was constructed from medical images, and the angle of the maxillary incisor was changed from the original position up to 30°.  ...  To examine the effects of inclined incisors on the phonation of [s], a simplified vocal tract model is proposed, and the acoustic characteristics with different maxillary incisor angles are predicted by  ...  Acknowledgements This work was supported by MEXT as "Priority Issue on Fugaku supercomputer" (hp200123, hp200134), JSPS KAKENHI (JP19H03976, JP20K14648), and JSPS Grant-in-Aid for Scientific Research on  ... 
doi:10.21437/interspeech.2021-231 dblp:conf/interspeech/YoshinagaTNI21 fatcat:u74icozphjbcdft4g4he5x4biy

A parametric method of computing acoustic characteristics of simplified three-dimensional vocal-tract model with wall impedance

Kunitoshi Motoki
2013 Acoustical Science and Technology  
A method of computing the acoustic characteristics of a simplified three-dimensional vocal-tract model with wall impedance is presented.  ...  The resonance characteristics of the vocal-tract model are evaluated using the radiated acoustic power.  ...  Hiroki Matsuzaki for valuable discussion on the acoustic analysis of the vocal tract.  ... 
doi:10.1250/ast.34.113 fatcat:dmwsd4se7rba5h4acqu26mwmgm

Speaker Identification From Youtube Obtained Data

Nitesh Kumar Chaudhary, Shraddha Srivastav
2014 Signal & Image Processing An International Journal  
identification is to identify the number of different speakers and prepare a model for that speaker by extraction, characterization and speaker-specific information contained in the speech signal.  ...  able to obtain 79 ~ 82% of identification rate using Vector quantization and 85 ~ 92.6% of identification rate using GMM modeling by Expectation maximization parameter estimation depending on variation  ...  Human's vocal tract is performing like a filter, and its frequency characteristics is dependent upon the resonance peak from the vocal tract and vocal tract configuration can be obtained from the spectral  ... 
doi:10.5121/sipij.2014.5503 fatcat:2fa2ypvmrfadxgglmd7tlulily

А COMPUTER REPRESENTATION OF THE GEOMETRICAL CONFIGURATION OF THE VOCAL TRACT

M Mihkla
1979 Proceedings of the Academy of Sciences of the Estonian SSR. Physics. Mathematics  
Several works on the geometrical configuration of the vocal tract are restricted to its sagittal model, i. e.  ...  Thus the articulatory model of the vocal tract is reduced to the task of giving, in the case of each sound, as accurate and convenient a description of the position or movement of the tongue (and lips)  ... 
doi:10.3176/phys.math.1979.3.11 fatcat:qz2mrdz5nrbavfiwu5joiggvfi

Hybrid parametric-physiological glottal modelling with application to voice quality assessment

Carlo Drioli, Federico Avanzini
2002 Medical Engineering and Physics  
A glottal model based on physical constraints is proposed. The model describes the vocal fold as a simple oscillator, i.e. a damped mass-spring system.  ...  The model is used to analyse voiced sounds from normal and from pathological voices, and the application of the proposed analysis procedure to voice quality assessment is discussed.   ...  The use of parametric and physiological models of vocal emission has been proposed for a wide range of applications, namely speech synthesis, speech coding and compression, and voice quality analysis and  ... 
doi:10.1016/s1350-4533(02)00057-7 pmid:12237039 fatcat:odafws5jffdrbmvid3k5dixp2i

Complex cepstrum factorization for statistical parametric synthesis

Ranniery Maia, Yannis Stylianou
2014 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)  
This paper presents a study on complex cepstrum-based speech factorization for acoustic modeling in statistical parametric synthesizers.  ...  The factorization is conducted assuming that both vocal tract resonance and glottal flow effect are fully represented by the complex cepstrum.  ...  as vocal tract and glottal flow parameters.  ... 
doi:10.1109/icassp.2014.6854320 dblp:conf/icassp/MaiaS14 fatcat:suupk6sd6rfblobjpsrky3mtpa

A New Glottal Neural Vocoder for Speech Synthesis

Yang Cui, Xi Wang, Lei He, Frank K. Soong
2018 Interspeech 2018  
Direct modeling of waveform generation for speech synthesis, e.g. WaveNet, has made significant progress on improving the naturalness and clarity of TTS.  ...  In the analysis, speech signals are decomposed into corresponding glottal source signals and vocal tract filters by the glottal inverse filtering.  ...  However, such autoregressive models are a lot more computational and more memory expensive than traditional parametric vocoders.  ... 
doi:10.21437/interspeech.2018-1757 dblp:conf/interspeech/CuiWHS18 fatcat:iotczrrrrnhnzmcmu4wl6nvowm

Computer-Implemented Articulatory Models for Speech Production : A Review

Bernd J. Kröger
2022 Frontiers in robotics and AI 9  
us to reach the goal of high-quality articulatory-acoustic speech synthesis based on more detailed knowledge on vocal tract acoustics and speech articulation.  ...  Thus, on the one hand computer-modeling will help us to unfold underlying biological as well as acoustic-articulatory concepts of speech production and on the other hand further modeling efforts will help  ...  i.e., the (one-dimensional) distance between articulator and vocal tract wall as function of distance from the glottis along the midline of the vocal tract from glottis to mouth (e.g., Stone et al., 2018  ... 
doi:10.18154/rwth-2022-04686 fatcat:53ufwkkzmzcelpja7wedbcfnhq

Computer-Implemented Articulatory Models for Speech Production: A Review

Bernd J. Kröger
2022 Frontiers in Robotics and AI  
us to reach the goal of high-quality articulatory-acoustic speech synthesis based on more detailed knowledge on vocal tract acoustics and speech articulation.  ...  Thus, on the one hand computer-modeling will help us to unfold underlying biological as well as acoustic-articulatory concepts of speech production and on the other hand further modeling efforts will help  ...  i.e., the (one-dimensional) distance between articulator and vocal tract wall as function of distance from the glottis along the midline of the vocal tract from glottis to mouth (e.g., Stone et al., 2018  ... 
doi:10.3389/frobt.2022.796739 pmid:35494539 pmcid:PMC9040071 fatcat:qtr7qnir6zdlplyywsdd4kxad4

Cantor Digitalis: chironomic parametric synthesis of singing

Lionel Feugère, Christophe d'Alessandro, Boris Doval, Olivier Perrotin
2017 EURASIP Journal on Audio, Speech, and Music Processing  
The sound generation system is based on a parametric synthesizer that features a spectral voice source model, a vocal tract model consisting of parallel filters for vocalic formants and cascaded with anti-resonance  ...  Because Cantor Digitalis is a parametric system, every aspect of voice quality can be controlled (e.g., vocal tract size, aperiodicities in the voice source, vowels, and so forth).  ...  LF and OP developed the software and documentation, with contributions of CdA and BD. LF and  ... 
doi:10.1186/s13636-016-0098-5 fatcat:ytmc4ey5xjdeljymb5kjd3mp2m

A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems

Min-Jae Hwang, Eunwoo Song, Jin-Seob Kim, Hong-Goo Kang
2018 Interspeech 2018  
In this paper, we propose a unified training framework for the generation of glottal signals in deep learning (DL)-based parametric speech synthesis systems.  ...  To alleviate this problem, we propose a unified training approach that directly generates speech parameters by merging all the required models, such as acoustic, glottal, and noise models, into a single  ...  MbG-structured glottal vocoding system The output AFs consist of vocal tract line spectral frequencies (LSF-VT), a voicing flag (VUV), a logarithm energy (Erg), vocal source LSFs (LSF-VS), and a logarithm  ... 
doi:10.21437/interspeech.2018-1590 dblp:conf/interspeech/HwangSKK18 fatcat:vlf7ywjxcvaipndr27ltlmxfyi

Analyses of vocal tract cross-distance to area mapping: An investigation of a set of vowel images

Richard S. McGowan, Michel T-T. Jackson, Michael A. Berger
2012 Journal of the Acoustical Society of America  
These are not the optimal models on which to base the construction of a mapping between the two domains.  ...  One is a vowel height-sensitive model and the other is a nonparametric model called loess. These depend on global cross-distance information and generally perform better than the traditional models.  ...  a-b models that depend on vowel features and a non-parametric model called loess.  ... 
doi:10.1121/1.3665988 pmid:22280604 pmcid:PMC3272714 fatcat:3wsgoj2245b7ffmwt5cnfwo2xa

Speaker adaptive voice source modeling with applications to speech coding and processing

Carlo Drioli, Andrea Calanca
2014 Computer Speech and Language  
Acknowledgements We wish to thank the two anonymous reviewers for their extremely useful and constructive comments, which helped us to significantly improve this paper. Appendix A.  ...  Scheme of the low-dimensional voice source used as glottal waveform generator (note that the vocal tract model is not represented here).  ...  an all-pole model of the vocal tract, and u g (t) is the derivative of u g (t), the glottal pulse waveform.  ... 
doi:10.1016/j.csl.2014.01.002 fatcat:pnsjkwty3ng53kraf44bozwcsy

Acoustic To Articulatory Speech Inversion Using Multi-Resolution Spectro-Temporal Representations Of Speech Signals [article]

Rahil Parikh, Nadee Seneviratne, Ganesh Sivaraman, Shihab Shamma, Carol Espy-Wilson
2022 arXiv   pre-print
These features produce a higher dimensional representation of the speech signals.  ...  Experiments achieved a correlation of 0.675 with ground-truth tract variables.  ...  acoustic and corresponding articulatory patterns is constructed from the training data • Analytical approaches involving articulatory models such as Maeda's [7] • Statistical modeling (parametric and  ... 
arXiv:2203.05780v2 fatcat:5kgx5lidnrdz5bzjofqqv7pejy
« Previous Showing results 1 — 15 out of 1,273 results