135 Hits in 5.7 sec

Multiple F0 estimation in vocal ensembles using convolutional neural networks

Helena Cuesta, Brian McFee, Emilia Gomez
2020 Zenodo  
Our models outperform a state-of-the-art method intended for the same music genre when evaluated with an increased F0 resolution, as well as a general-purpose method for multi-F0 estimation.  ...  The pitch salience function is subsequently thresholded to obtain a multiple F0 estimation output.  ...  This work is partially supported by the European Commission under the TROMPA project (H2020 770376) and MARL-NYU (as part of a two-months research stay).  ... 
doi:10.5281/zenodo.4245434 fatcat:b2wpxk4e2vdktks43cpmfh5pvm

Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks [article]

Helena Cuesta, Brian McFee, Emilia Gómez
2020 arXiv   pre-print
Our models outperform a state-of-the-art method intended for the same music genre when evaluated with an increased F0 resolution, as well as a general-purpose method for multi-F0 estimation.  ...  The pitch salience function is subsequently thresholded to obtain a multiple F0 estimation output.  ...  This work is partially supported by the European Commission under the TROMPA project (H2020 770376) and MARL-NYU (as part of a two-months research stay).  ... 
arXiv:2009.04172v1 fatcat:mxsjqmmajjfalpjeaf6rlsvy3a

Data-driven Pitch Content Description of Choral Singing Recordings

Helena Cuesta, Emilia Gómez
2022 Zenodo  
Finally, we propose two methods to characterize vocal unison performances in terms of pitch dispersion.  ...  The second contribution is a set of deep learning models for multiple F0 estimation, streaming, and voice assignment of vocal quartets, mainly based on convolutional neural networks designed leveraging  ...  We wish for a bright future for research on choir music in the MIR field.  ... 
doi:10.5281/zenodo.6389643 fatcat:zibszrdivjhcnap2gzll3sbxga

Score-Informed Analysis Of Intonation And Pitch Modulation In Jazz Solos

Jakob Abeßer, Estefanía Cano, Klaus Frieler, Martin Pfleiderer, Wolf-Georg Zaddach
2015 Zenodo  
There are many studies on vibrato detection in audio recordings [14] , particularly for singing voice [8, 9, 12] .  ...  Based on the extracted f 0 contours, we compute several contour features (see Section 4.5) to describe their temporal shape.  ... 
doi:10.5281/zenodo.1416835 fatcat:xq6mw7qbonemfkys6wyyng5you

Score-Informed Analysis of Tuning, Intonation, Pitch Modulation, and Dynamics in Jazz Solos

Jakob Abeber, Klaus Frieler, Estefania Cano, Martin Pfleiderer, Wolf-Georg Zaddach
2017 IEEE/ACM Transactions on Audio Speech and Language Processing  
Next, we compute the fundamental frequency contour for each tone in the solo and a set of features describing its temporal shape.  ...  After splitting the audio into a solo and a backing track, a reference tuning frequency is estimated from the backing track.  ...  ACKNOWLEDGEMENTS The Jazzomat research project is supported by a grant DFG-PF 669/7-1 ("Melodisch-rhythmische Gestaltung von Jazzimprovisationen.  ... 
doi:10.1109/taslp.2016.2627186 fatcat:6yw7asmxh5bf5lpr4zp7tcpnjy

The role of auditory feedback in speech and song

Tim A. Pruitt, Peter Q. Pfordresher
2015 Journal of Experimental Psychology: Human Perception and Performance  
When singing a melody or producing sentences, we take for granted the fact that the sounds we create (auditory feedback) match the intended consequences of our actions.  ...  On the other hand, AAF manipulations of pitch disrupt sequencing but not timing.  ...  This method of pitch estimation serves to reduce spurious F 0 measurements that result from pitch tracking artifacts or idiosyncratic voice qualities (e.g., "voice breaking") and has been used in previous  ... 
doi:10.1037/a0038285 pmid:25384239 fatcat:qvzwga62rrbtxmt5hytqxhh76m

Pitch tracking of bird vocalizations and an automated process using YIN-bird

Colm O'Reilly, Naomi Harte, Hynek Burda
2017 Cogent Biology  
To use pitch as a feature, researchers need confidence in their pitch extraction system.  ...  This paper discusses pitch estimation performance on a variety of common bird vocalizations.  ...  Babacan et al. (2013) evaluated pitch tracking on singing voice, with PRAAT and RAPT providing the best determination of voicing boundaries.  ... 
doi:10.1080/23312025.2017.1322025 fatcat:d4u7laz6enaspmtyk2s4tzuvce

Analysis/synthesis comparison

2000 Organised Sound  
Although we have not directly compared the analysis results among the different systems, our work has made such a comparison possible.  ...  amplitude and frequency contours extracted from the surface and yield an accurate synthesis.  ...  As an example, stretching a low-pitched voice with this model usually leads to rather unnatural voice sounds.  ... 
doi:10.1017/s1355771800005070 fatcat:2l7qagmt7ngjxnnadzlnoemw5e

Lexical function of pitch in the first language shapes cross-linguistic perception of Thai tones

Vance Schaefer, Isabelle Darcy
2014 Laboratory Phonology  
., three level tones, two contour tones) by speakers of languages on a spectrum of lexically contrastive pitch usage: Mandarin (lexical tone), Japanese (lexical pitch accent), English (lexical stress),  ...  , intonation) or non-linguistic (e.g., singing).  ...  The female voice was used for the A and B items while the male voice was used for the X item, which can be either of the A or of the B category.  ... 
doi:10.1515/lp-2014-0016 fatcat:wepl4px5hbcclmlqppjrxkafwi

Identification and Control of PMSM Using Artificial Neural Network

Rajesh Kumar, R. A. Gupta, Ajay Kr. Bansal
2007 2007 IEEE International Symposium on Industrial Electronics  
As the first step, they extracted the pitch contours from automatically selected voice segments.  ...  Harmony Melodies are generated by a sequence of pitches-the "tune" of musical piece While sequences of pitches create melodies-the "tune" of a musical piece, and "voice" only part that can be regenerated  ... 
doi:10.1109/isie.2007.4374567 fatcat:hjhdj43wuzhpdlin5yi2sgmtb4

Multilinear Grammar: Ranks and Interpretations

Dafydd Gibbon, Sascha Griffiths
2017 Open Linguistics  
AbstractMultilinear Grammar provides a framework for integrating the many different syntagmatic structures of language into a coherent semiotically based Rank Interpretation Architecture, with default  ...  Default computational models for each rank are proposed, based on a Procedural Plausibility Condition: incremental processing in linear time with finite working memory.  ...  And in particular it is becoming clear that a reappraisal of the descriptive capacity of finite state systems and plausible extensions is needed.  ... 
doi:10.1515/opli-2017-0014 fatcat:wp7mjia5ezhp5bufvcrj3hjpni

Adaptive additive modeling with continuous parameter trajectories

A. Robel
2006 IEEE Transactions on Audio, Speech, and Language Processing  
The adaptive analysis system is investigated by means of simple tracking experiments to demonstrate the effect of the smoothness constraints and compare the results with a standard STFT base frequency  ...  The potential of the adaptive strategy for the modeling of sinusoidal transients is discussed and it is shown that it achieves similar transient quality as a previously proposed method, however, with considerably  ...  The example concerned with tracking of frequency evolution is a singing voice signal that contains considerable pitch changes.  ... 
doi:10.1109/tsa.2005.858529 fatcat:t6ybpq7u6vfv5cptirkjmx77mq

Perception of emotion in musical performance in adolescents with autism spectrum disorders

Anjali Bhatara, Eve-Marie Quintin, Bianca Levy, Ursula Bellugi, Eric Fombonne, Daniel J. Levitin
2010 Autism Research  
Musical performance also employs a form of prosody to communicate emotion, and the goal of this study was to examine the ability of adolescents with ASD to understand musical emotion.  ...  ''cantabile'' meaning ''in a singing style,' ' Kennedy, 1999] .  ...  ; Heaton, Hermelin, & Pring, 1998 ], preserved or superior sensitivity for detecting pitch direction [Heaton, 2005] and contour change [Mottron, Peretz, & Ménard, 2000] , and superior chord disembedding  ... 
doi:10.1002/aur.147 pmid:20717952 pmcid:PMC2963682 fatcat:eoraitoupzhwbekgjgpad2bhmy

Sound, structure and meaning: The bases of prominence ratings in English, French and Spanish

Jennifer Cole, José I. Hualde, Caroline L. Smith, Christopher Eager, Timothy Mahrt, Ricardo Napoleão de Souza
2019 Journal of Phonetics  
In addition, words with a ToBI pitch accent type that is typically associated with contrastive focus are more likely to be rated as prominent in Spanish and English, but no such effect is found for French  ...  Prominence ratings from untrained listeners correspond with ToBI pitch accent labels for each language.  ...  /kánto/ 'I sing' vs /kantó/ 'she or he sang').  ... 
doi:10.1016/j.wocn.2019.05.002 fatcat:oeoub5woozccdcwuonqsn2n2qe

Reassignment of consonant allophones in rapid dialect acquisition

James S. German, Katy Carlson, Janet B. Pierrehumbert
2013 Journal of Phonetics  
This experiment therefore explored (a) whether speakers could learn to reassign a sound they already produce (flap) to a different phoneme, and (b) whether they could learn to reliably produce aspirated  ...  In an experiment spanning a week, American English speakers imitated a Glaswegian (Scottish) English speaker.  ...  Acknowledgements We would especially like to thank our Glaswegian speaker, Alistair McGowan, for lending us his time and his voice. We would like to acknowledge the support of the James S.  ... 
doi:10.1016/j.wocn.2013.03.001 fatcat:vi7my5byazc2hi2oy442wswvve
« Previous Showing results 1 — 15 out of 135 results