Filters








1,526 Hits in 4.9 sec

Perception of size modulated vowel sequence: Can we normalize the size of continuously changing vocal tract?

Minoru Tsuzaki, Chihiro Takeshima, Toshio Irino
2009 Acoustical Science and Technology  
Several studies have demonstrated that the normalization of vocal tract size can be achieved in a bottom-up manner.  ...  Changes in vocal tract size vary the formant frequencies, even when the shape of vocal tracts is the same and the spoken vowels are categorized to be the same.  ...  According to a study that investigated the relation between the height and the vocal tract length [8] , for which the correlation was high, the vocal tract length of the current speaker was estimated  ... 
doi:10.1250/ast.30.83 fatcat:op7uk6gkdnc3vpiaajwuzaliga

Speech MRI: Morphology and function

Andrew D. Scott, Marzena Wylezinska, Malcolm J. Birch, Marc E. Miquel
2014 Physica medica (Testo stampato)  
The earliest real-time gradient echo studies in the vocal tract [149] used FLASH sequences, had poor temporal resolution (3 fps), low spatial resolution (3 Â 6 Â 8 mm 3 ) and were used to study vocal  ...  Both FLASH [133, 141, 142] , and bSSFP [33, 59, 143] acquisitions have been used for real-time imaging of the vocal tract in the mid-sagittal plane.  ... 
doi:10.1016/j.ejmp.2014.05.001 pmid:24880679 fatcat:b6ht4f3g65dnreipemxjrt3qsu

Effect of combined source (F0) and filter (formant) variation on red deer hind responses to male roars

Benjamin D. Charlton, David Reby, Karen McComb
2008 Journal of the Acoustical Society of America  
vocal signals.  ...  The results revealed that red deer hinds showed greater overall attention and had lower response latencies to playbacks of roars where lower formants simulated larger males.  ...  tract length ͑VTL͒, and overall body size ͑Fitch, 2000͒.  ... 
doi:10.1121/1.2896758 pmid:18529210 fatcat:26gx7znfwrggdbnwf5qu7z7rhm

A Biomimetic Vocalisation System for MiRo [article]

Roger K. Moore, Ben Mitchinson
2017 arXiv   pre-print
This paper describes how MiRo's vocalisation system was designed, not using pre-recorded animal sounds, but based on the implementation of a real-time parametric general-purpose mammalian vocal synthesiser  ...  According to [11] , vocal tract length is correlated with body size: L = 3.15 + (11.53 × log M ). (7) This means that large animals have long vocal tracts and thus low formant frequencies (and vice versa  ...  tract length (using Equation 7 ).  ... 
arXiv:1705.05472v1 fatcat:2utipdjcljdunhxof6lpiqx6ga

A Biomimetic Vocalisation System for MiRo [chapter]

Roger K. Moore, Ben Mitchinson
2017 Lecture Notes in Computer Science  
This paper describes how MiRo's vocalisation system was designed, not using pre-recorded animal sounds, but based on the implementation of a real-time parametric general-purpose mammalian vocal synthesiser  ...  According to [11] , vocal tract length is correlated with body size: L = 3.15 + (11.53 × log M ). (7) This means that large animals have long vocal tracts and thus low formant frequencies (and vice versa  ...  tract length (using Equation 7 ).  ... 
doi:10.1007/978-3-319-63537-8_30 fatcat:cqyvmvd6izdkllasmq3tquxkcm

State-of-the-Art MRI Protocol for Comprehensive Assessment of Vocal Tract Structure and Function

Sajan Goud Lingala, Asterios Toutios, Johannes Töger, Yongwan Lim, Yinghua Zhu, Yoon-Chul Kim, Colin Vaz, Shrikanth S. Narayanan, Krishna S. Nayak
2016 Interspeech 2016  
vocal tract Real-time 2D imaging was performed via a custom real-time interactive imaging platform (RT-Hawk, Heart Vista Inc, Los Altos, CA) [7] .  ...  Index Terms: MRI system for speech production, constrained reconstruction, rapid real-time MRI, structural and functional characterization of vocal tract Rapid 2D real-time MRI to evaluate dynamics of  ...  Table 1 : 1 Comprehensive vocal-tract imaging protocol Purpose Index Task Length Real-time 2D MRI (scripted speech) R1-R3 Consonants in symmetric VCV (3 scans) 30 sec (x3) R4 Vowels in  ... 
doi:10.21437/interspeech.2016-559 dblp:conf/interspeech/LingalaTTLZKVNN16 fatcat:ewzujoabprellnnvagr7mb3ddy

Assessment of infant cry: Acoustic cry analysis and parental perception

Linda L. LaGasse, A. Rebecca Neal, Barry M. Lester
2005 Mental Retardation and Developmental Disabilities Research Reviews  
The cry signal results from coordination among several brain regions that control respiration and vocal cord vibration from which the cry sounds are produced.  ...  Recall that formants are the harmonic content of a sound produced by mouth shape and vocal cord length. The infant vocal tract is obviously different in size and shape from the adult vocal tract.  ...  The infant vocal tract is smaller than the adult vocal track.  ... 
doi:10.1002/mrdd.20050 pmid:15856439 fatcat:hnmx3zjoi5fnjaqpqksqkhdyle

Electrolaryngeal Speech Identification using GMM

Nandana R., Mar Baselios College of Engineering and Technology
2020 International Journal of Engineering Research and  
Speech with highest score is identified and corresponding normal speech is produced using python platform.  ...  As air is breathed out through the vocal folds, vocal folds are vibrated and sound is produced, heard as a speech voice.  ...  Detection in real time E. Latency Test Total internal time taken by the system to recognize an EL word can be obtained by latency test.  ... 
doi:10.17577/ijertv9is060728 fatcat:fa6wliud6fh7hbm3ma2az7lvvq

Microphone-Based Electronic Wind Instrument By Feature Extraction From Breath Signals

Francesco Bigoni, Pelle Juul Christensen, Rasmus Eklund, Javier Molina Garcia, Cumhur Erkut
2018 Proceedings of the SMC Conferences  
The power spectra of normal blows in free air are smooth (with most energy at low frequencies) since a normal blow is essentially a noise signal.  ...  the position of the articulators in the vocal tract.  ... 
doi:10.5281/zenodo.1422534 fatcat:r3fs2gscffbtzmnnzjk3j2zsp4

Segmentation Strategies for Streaming Speech Translation

Vivek Kumar Rangarajan Sridhar, John Chen, Srinivas Bangalore, Andrej Ljolje, Rathinavelu Chengalvarayan
2013 North American Chapter of the Association for Computational Linguistics  
In order to improve ASR performance for our diverse data set, adaptation techniques such as constrained model adaptation and vocal tract length normalization are found to be useful.  ...  We also experiment with inserting text segmenters of various types between ASR and MT in a series of real-time translation experiments.  ...  The Vocal Tract Length Normalization (VTLN) was applied in two different ways. One was estimated on an utterance level, and the other at the talk level.  ... 
dblp:conf/naacl/SridharCBLC13 fatcat:2mihby6tnnht7l7utsdxsednki

Singing-driven interfaces for sound synthesizers

Jordi Janer, Xavier Serra
2008 Zenodo  
For the latter, we suggest to represent voice signal as vocal gestures, contributing with several voice analysis methods. To demonstrate the obtained results, we developed two real-time prototypes.  ...  This segmentation algorithm is designed for a real-time operation in low-latency conditions.  ...  Performance Analysis Real-time operation In terms of interaction, a digital musical instrument should work in real-time and with a latency as low as possible.  ... 
doi:10.5281/zenodo.3685558 fatcat:lnxevw4cmzht5m6rz2tgtk6a4y

Cortical sensitivity to periodicity of speech sounds

Santeri Yrttiaho, Hannu Tiitinen, Patrick J. C. May, Sakari Leino, Paavo Alku
2008 Journal of the Acoustical Society of America  
The behavior of the N1m latency and the emergence of a response cascade at very low F0 values may reflect the lower limit of pitch perception.  ...  Vowel F0 ranged from normal male speech ͑113 Hz͒ to exceptionally low values ͑9 Hz͒. Both the auditory N1m and sustained fields were larger in amplitude for periodic than for aperiodic vowels.  ...  The experiment was run with Presentation® software ͑Neurobehavioral Systems, Inc.͒ suitable for real-time delivery of auditory stimuli.  ... 
doi:10.1121/1.2888489 pmid:18397025 fatcat:luqhreg7kjeuvekzl335ll2pqm

A Vibration Control Method of an Electrolarynx Based on Statistical F0 Pattern Prediction

Kou TANAKA, Tomoki TODA, Satoshi NAKAMURA
2017 IEICE transactions on information and systems  
By using the prototype system, we find latency issues caused by a real-time processing.  ...  An electrolarynx is an external device to generate excitation signals, instead of vibration of the vocal folds.  ...  Their ability to generate sound source excitation signals is severely impaired because they no longer have their vocal folds, although their vocal tracts remain.  ... 
doi:10.1587/transinf.2016edp7485 fatcat:wjf2s7vtwvclnocrj5yjeenm2y

Neural substrates of vocalizations in gulls and pigeons

JuanD. Delius
1971 Experimental Brain Research  
The wide gammut of calls and accompanying behaviour that was elicited is described and related to the normal behaviour, typical of the species concerned.  ...  It is suggested that the inferior colliculus incorporates the origin of a final common pathway to medular motor centres for all vocalization generating structures.  ...  Even while proposing this scheme, I believe that it does not adequately reflect the real complexity of the vocalization substrate.  ... 
doi:10.1007/bf00234416 pmid:5543202 fatcat:yawuor6y7fdubfwxb6tyrcmnq4

Disentangling the effects of phonation and articulation: hemispheric asymmetries in the auditory N1m response of the human brain

Hannu Tiitinen, Anna Mari Mäkelä, Ville Mäkinen, Patrick J C May, Paavo Alku
2005 BMC Neuroscience  
from the vocal tract leads to asymmetric behaviour of the left and right hemisphere.  ...  The variations in the latency and strength of the auditory N1m response can be traced back to the spectral structure of the stimuli.  ...  Changing the shape and the length of the vocal tract results in different formant frequency settings and, consequently, in variations of the perceived phoneme category.  ... 
doi:10.1186/1471-2202-6-62 pmid:16225699 pmcid:PMC1280927 fatcat:wlxtwzg3brckljbik66ii4ib7i
« Previous Showing results 1 — 15 out of 1,526 results