Filters








2,888 Hits in 3.2 sec

Audiovisual Singing Voice Separation [article]

Bochen Li, Yuxuan Wang, Zhiyao Duan
2021 arXiv   pre-print
To facilitate the network to learn audiovisual correlation of singing activities, we add extra vocal signals irrelevant to the mouth movement to the audio mixture during training.  ...  We create two audiovisual singing performance datasets for training and evaluation, respectively, one curated from audition recordings on the Internet, and the other recorded in house.  ...  This line of research achieves promising results in audiovisual music separation, but have not addressed singing voice separation. III. METHOD A.  ... 
arXiv:2107.00231v1 fatcat:hzlhescxzvcedkrpsl5a7zkqyy

Audiovisual Singing Voice Separation

Bochen Li, Yuxuan Wang, Zhiyao Duan
2021 Transactions of the International Society for Music Information Retrieval  
To facilitate the network to learn audiovisual correlation of singing activities, we add extra vocal signals irrelevant to the mouth movement to the audio mixture during training.  ...  We create two audiovisual singing performance datasets for training and evaluation, respectively, one curated from audition recordings on the Internet, and the other recorded in house.  ...  This line of research achieves promising results in audiovisual music separation for musical instrument performances, but not yet on singing voice separation.  ... 
doi:10.5334/tismir.108 fatcat:5k2cd26tufbi7mv7lx6kvlomee

Distributed neural signatures of natural audiovisual speech and music in the human auditory cortex

Juha Salmi, Olli-Pekka Koistinen, Enrico Glerean, Pasi Jylänki, Aki Vehtari, Iiro P. Jääskeläinen, Sasu Mäkelä, Lauri Nummenmaa, Katarina Nummi-Kuisma, Ilari Nummi, Mikko Sams
2017 NeuroImage  
Distributed neural signatures of natural audiovisual speech and music in the human auditory cortex, NeuroImage, http://dx.  ...  Abstract During a conversation or when listening to music, auditory and visual information are combined automatically into audiovisual objects.  ...  Four separate binary classifiers were trained to discriminate between Auditory and Audiovisual stimuli, both separately for each stimulus type (Piano, Singing, and Speech), as well as for all Auditory  ... 
doi:10.1016/j.neuroimage.2016.12.005 pmid:27932074 fatcat:h2uitc7ehvfurflwudzpfd6sxu

Perception of Multisensory Gender Coherence in 6- and 9-Month-Old Infants

Anne Hillairet de Boisferon, Eve Dupierrix, Paul C. Quinn, Hélène Lœvenbruck, David J. Lewkowicz, Kang Lee, Olivier Pascalis
2015 Infancy  
The two faces were separated by a 15-cm gap. For each infant, the gender of the voice was the same in both trials.  ...  sing it.  ... 
doi:10.1111/infa.12088 pmid:26561475 pmcid:PMC4637175 fatcat:bejartsmvffp3i3xi5kcto4bxy

Mamãe eu quero: Carmen Miranda's Maternal Abundance

Sean Griffin
2016 Rebeca: Revista Brasileira de Estudos de Cinema e Audiovisual  
The voice also traces the forms of unity and separation between bodies.  ...  Roberts, 15-16, discusses how gay male culture during the early 40s appropriated Miranda. revista brasileira de estudos de cinema e audiovisual she is singing about.  ... 
doi:10.22475/rebeca.v1n2.283 fatcat:6oc5sjdrz5fnnfzov2m2p2rvum

Infants' responsiveness to maternal speech and singing

Takayuki Nakata, Sandra E. Trehub
2004 Infant Behavior and Development  
Infants who were 6 months of age were presented with extended audiovisual episodes of their mother's infantdirected speech or singing.  ...  Cumulative visual fixation and initial fixation of the mother's image were longer for maternal singing than for maternal speech.  ...  Test stimuli generated by the procedure consisted of 4-min audiovisual segments of singing or speaking from each mother.  ... 
doi:10.1016/j.infbeh.2004.03.002 fatcat:2ecfqw6ibbcobfq6cfdx25yxgi

Page 304 of Colby Quarterly Vol. 36, Issue 4 [page]

2000 Colby Quarterly  
This separation of Kathy’s voice from her body, how- ever, does not disembody it in a manner similar to voice-over, which usually implies a position of power within—sometimes over—the cinematic appara-  ...  Rather, her voice and the song she sings remain firmly anchored within the diegesis: at each transition we see how the source of the song’s music 1s always emanating from a space clearly within the story  ... 

Read my lips: speech distortions in musical lyrics can be overcome (slightly) by facial information

Dominic W. Massaro, Alexandra Jesse
2009 Speech Communication  
Changes in vowel intelligibility due to singing are mostly a problem in high-pitch female singing voices, such as sopranos.  ...  However, the timing of the articulatory movements were exactly aligned to the singing voice.  ... 
doi:10.1016/j.specom.2008.05.013 fatcat:hx5khoacxnaw3nv6nts24ss4xm

Automatic Singing Voice To Music Video Generation Via Mashup Of Singing Video Clips

Tatsunori Hirai, Yukara Ikemiya, Kazuyoshi Yoshii, Tomoyasu Nakano, Masataka Goto, Shigeo Morishima
2015 Proceedings of the SMC Conferences  
Singing voice separation To extract singing voice features, a singing voice separation method is required. We apply the singing voice separation method proposed by Ikemiya et al.  ...  p p p p p p p Singing voice separation VAD Vocal secƟon Inst.  ... 
doi:10.5281/zenodo.851032 fatcat:tsp2j3ca6fhxfbviznfqroxvx4

VoViT: Low Latency Graph-based Audio-Visual Voice Separation Transformer [article]

Juan F. Montesinos, Venkatesh S. Kadandale, Gloria Haro
2022 arXiv   pre-print
This paper presents an audio-visual approach for voice separation which produces state-of-the-art results at a low latency in two scenarios: speech and singing voice.  ...  Finally, we explore the transferability of models trained for speech separation in the task of singing voice separation. The demos, code, and weights are available in https://ipcv.github.io/VoViT/  ...  of singing voice separation.  ... 
arXiv:2203.04099v2 fatcat:2q576qc2iraofode6tpzmzxehy

Animating song

Scott A. King, Richard E. Parent
2004 Computer Animation and Virtual Worlds  
Modifications to a text-1 to-audiovisual-speech system have been made to take the extra information of timing and frequency of the lyrics from a MIDI file.  ...  We would like to express gratitude toward Judy Bellingham for sharing her knowledge of singing with us, and to Sui-Ling Ming-Wong and Alexis Angelidis for proofreading the text of this article.  ...  Likewise, we have extended our text-to-audiovisual-speech (TTAVS) [7] system to include singing.  ... 
doi:10.1002/cav.7 fatcat:a3pjtomp3rholioinwahqh4awi

Design and Evaluation of a Real-Time Audio Source Separation Algorithm to Remix Music for Cochlear Implant Users

Sina Tahmasebi, Tom Gajȩcki, Waldo Nogueira
2020 Frontiers in Neuroscience  
voice.  ...  Moreover, the implemented model was optimized to perform real-time source separation.  ...  requiring an even further enhancement of the singing voice.  ... 
doi:10.3389/fnins.2020.00434 pmid:32508564 pmcid:PMC7248365 fatcat:pnshrx74dfb6tem67htdfdou3a

Smartvox. A Web-Based Distributed Media Player As Notation Tool For Choral Practices

Jonathan Bell, Benjamin Matuszewski
2017 Zenodo  
Technically, SmartVox is a distributed web application that delivers audiovisual scores through the performer's mobile devices.  ...  It also enables spatial separation of the performers (cori spezzati ), and speeds up the learning process of unfamiliar musical materials (e.g. microtonal tuning, texts in a foreign language).  ...  In a pedagogical piece composed for this system, the notation purposefully conveyed the same pitch information in four different ways: • Sound frequency: a synthetic voice sings on a given pitch, e.g.,  ... 
doi:10.5281/zenodo.924142 fatcat:36i24i3crbdohahqewblykooxe

Retratando o musicar do bumba meu boi no audiovisual

Luiza Fernandes Coelho
2021 GIS - Gesto Imagem e Som - Revista de Antropologia  
Este artigo apresenta reflexões sobre como a prática do musicar (Small 1998) e o caráter participativo (Turino 2008) do bumba meu boi são traduzidos na linguagem audiovisual (Romero e Villela 2018) nas  ...  O artigo expõe quais musicares dessa manifestação cada produção enfatiza e quais as técnicas utilizadas para traduzi-los para a linguagem audiovisual, realizando também um comparativo entre as técnicas  ...  , who sing in chorus; during the interview, when we only listen to Ana Maria's voice singing; and Ariel and Ana Maria jam together in a performance, when he plays the pandeirão and she sings.  ... 
doi:10.11606/issn.2525-3123.gis.2021.175861 fatcat:habmtkvw5vdjhauxxt7ubeggqe

Four-Month-Olds' Discrimination of Voice Changes in Multimodal Displays as a Function of Discrimination Protocol

Jason S. McCartney, Robin Panneton
2005 Infancy  
The results indicated that 4-month-old infants discriminated voice changes in dynamic face + voice displays depending on the order in which they were viewed during the infant-controlled habituation procedure  ...  studies have found equivocal support for the ability of young infants to discriminate infant-directed (ID) speech information in the presence of auditory-only versus auditory + visual displays (faces + voices  ...  face + female singing display.  ... 
doi:10.1207/s15327078in0702_3 pmid:33430551 fatcat:zaxbpvhuqjftjlrrvporgaqdae
« Previous Showing results 1 — 15 out of 2,888 results