Filters








82 Hits in 9.8 sec

Lyrics Recognition From A Singing Voice Based On Finite State Automaton For Music Information Retrieval

Toru Hosoya, Motoyuki Suzuki, Akinori Ito, Shozo Makino
2005 Zenodo  
LYRICS RECOGNITION BASED ON A FINITE STATE AUTOMATON Introduction A large vocabulary continuous speech recognition (LVCSR) system performs speech recognition using two kinds of models -an acoustic model  ...  To achieve this, we used a finite state automaton (FSA) that accepts any subsequences of the lyrics in the database as a language model for lyrics recognition.  ... 
doi:10.5281/zenodo.1417854 fatcat:t4qp3onxp5durko2ijpefldz44

Music Information Retrieval from a Singing Voice Using Lyrics and Melody Information

Motoyuki Suzuki, Toru Hosoya, Akinori Ito, Shozo Makino
2006 EURASIP Journal on Advances in Signal Processing  
A finite state automaton (FSA) is used as recognition grammar, and about 86% retrieval accuracy was obtained. We also develop an algorithm for verifying a hypothesis output by a lyrics recognizer.  ...  Recently, several music information retrieval (MIR) systems which retrieve musical pieces by the user's singing voice have been developed.  ...  LYRICS RECOGNITION BASED ON A FINITE STATE AUTOMATON Introduction An LVCSR system performs speech recognition using two kinds of models-an acoustic model and a language model.  ... 
doi:10.1155/2007/38727 fatcat:yw4fsuexffdv5am42x3y75x2cm

Music Information Retrieval From A Singing Voice Based On Verification Of Recognized Hypotheses

Motoyuki Suzuki, Toru Hosoya, Akinori Ito, Shozo Makino
2006 Zenodo  
This method uses a finite state automaton as a language model. It gave 77.4% word accuracy, and the retrieval accuracy given by the system achieved 85.9%.  ...  In order to develop an MIR system that uses melody and lyrics information, the lyrics recognition method from a singing voice has been proposed [5] .  ... 
doi:10.5281/zenodo.1414785 fatcat:tntomb33ivhm5ov5giqkfrrlua

Recognition of phonemes and words in singing

Annamaria Mesaros, Tuomas Virtanen
2010 2010 IEEE International Conference on Acoustics, Speech and Signal Processing  
The word-level language model is estimated from a textual lyrics database. In the recognition we use a hidden Markov model based phonetic recognizer adapted to singing voice.  ...  We study the use of the recognition results in a query-by-singing application. Using the recognized words, we retrieve the songs by searching for the text in a text lyrics database.  ...  Query-by-singing retrieval accuracy QUERY-BY-SINGING BASED ON WORD RECOGNITION In query-by-humming/singing, the aim is to identify a piece of music from its melody and lyrics.  ... 
doi:10.1109/icassp.2010.5495585 dblp:conf/icassp/MesarosV10 fatcat:oaxjgz3upzbfljmupuz5pxqxjq

Automatic Recognition of Lyrics in Singing

Annamaria Mesaros, Tuomas Virtanen
2010 EURASIP Journal on Audio, Speech, and Music Processing  
A query-by-singing retrieval application based on the recognized words is also constructed; in 57% of the cases, the first retrieved song is the correct one.  ...  The system is targeted to both monophonic singing and singing in polyphonic music. A vocal separation algorithm is applied to separate the singing from polyphonic music.  ...  The lyrics recognition grammar was a finite state automaton constructed from the lyrics in the queried database.  ... 
doi:10.1186/1687-4722-2010-546047 fatcat:lksnvnccivd7ziqyoe3ld2u5lm

Automatic Recognition of Lyrics in Singing

Annamaria Mesaros, Tuomas Virtanen
2010 EURASIP Journal on Audio, Speech, and Music Processing  
A query-by-singing retrieval application based on the recognized words is also constructed; in 57% of the cases, the first retrieved song is the correct one.  ...  The system is targeted to both monophonic singing and singing in polyphonic music. A vocal separation algorithm is applied to separate the singing from polyphonic music.  ...  The lyrics recognition grammar was a finite state automaton constructed from the lyrics in the queried database.  ... 
doi:10.1155/2010/546047 fatcat:64uir6egxzgkjhdnc3gwn4qnbe

Transcribing Lyrics From Commercial Song Audio: The First Step Towards Singing Content Processing [article]

Che-Ping Tsai, Yi-Lin Tuan, Lin-shan Lee
2018 arXiv   pre-print
The various problems in song audio, for example the significantly changing phone duration over highly flexible pitch contours, make the recognition of lyrics from song audio much more difficult.  ...  We collected music-removed version of English songs directly from commercial singing content.  ...  Having the language model learned from a data set of lyrics is definitely helpful [16, 18] . Hosoya et al. [17] achieved this with finite state automaton. Sasou et al.  ... 
arXiv:1804.05306v1 fatcat:bncfvhfnw5fsjbgzmbksomo4am

Automatic Lyrics Alignment and Transcription in Polyphonic Music: Does Background Music Help? [article]

Chitralekha Gupta, Emre Yılmaz, Haizhou Li
2019 arXiv   pre-print
music affects lyrics intelligibility of singing vocals in a music piece.  ...  We first compare several automatic speech recognition pipelines for the application of lyrics transcription.  ...  [4] described lyrics recognition grammar using a finite state automaton (FSA) built from the lyrics in the queried database, so as to exploit the linguistic constraints in lyrics such as rhyming patterns  ... 
arXiv:1909.10200v2 fatcat:6sc6dywp6jcvzhf35pyyluly3a

Music information retrieval

J. Stephen Downie
2005 Annual Review of Information Science and Technology  
Welcome friends and colleagues to the 2 nd Annual International Symposium on Music Information Retrieval -ISMIR 2001.  ...  Response to our Call for Papers was remarkable. Selecting the twenty papers for presentation (out of 40 submissions) and the eighteen posters for exhibition was no easy task.  ...  ACKNOWLEDGEMENTS I would like to thank Tim Crawford of Kings College London and Don Byrd of the University of Massachusetts at Amherst for their assistance in working on this topic and their patience in  ... 
doi:10.1002/aris.1440370108 fatcat:5v36lrlqbjfi5fkuxw3mzjyhhe

Polyphonic music retrieval

Shyamala Doraisamy
2005 SIGIR Forum  
For the retrieval of these 'overlaying' musical words, i.e., when more than one word can assume the same within-document position, a new proximity-based operator and a ranking function is proposed.  ...  This Music Information Retrieval (MIR) study investigates the use of n-grams and textual Information Retrieval (IR) approaches for the retrieval and access of polyphonic music data.  ...  In the studies surveyed, the error models were based on a definition of humming as singing with the syllable ta, da or la, and not whistling or singing with syllables derived from lyrics.  ... 
doi:10.1145/1067268.1067289 fatcat:6kh6aflmnjgvlnr2zwtpn32oxi

A virtual auditory environment for investigating the auditory signal processing of realistic sounds

Sylvain Favrot, Jörg M. Buchholz
2008 Journal of the Acoustical Society of America  
voices͒ in available music recordings and its applications to singer identification, singer similarity, and lyrics synchronization.  ...  Therefore, in order to separate a singer voice from background music, we suggest a source"filter model for the singer signal, keeping the same models as used by Benaroya for the background music signal  ...  A cellular automaton for urban traffic noise.  ... 
doi:10.1121/1.2936003 fatcat:ontu6yamdvgbnooet5fe36fm74

Analysis of room transfer function and reverberant signal statistics

Eleftheria Georganti, John Mourjopoulos, Finn Jacobsen
2008 Journal of the Acoustical Society of America  
voices͒ in available music recordings and its applications to singer identification, singer similarity, and lyrics synchronization.  ...  Therefore, in order to separate a singer voice from background music, we suggest a source"filter model for the singer signal, keeping the same models as used by Benaroya for the background music signal  ...  A cellular automaton for urban traffic noise.  ... 
doi:10.1121/1.2935346 fatcat:lxgnqr6tozajhge3ydet3xomam

Temporal suppression and augmentation of click-evoked otoacoustic emissions

Sarah Verhulst, James M. Harte, Torsten Dau
2008 Hearing Research  
The scanning and voxellisation of complex 3D objects for incorporation within finite difference time domain based acoustic prediction.  ...  Some new results of a beam tracing approach based on the uncertainty principle. Uwe M.  ...  voices͒ in available music recordings and its applications to singer identification, singer similarity, and lyrics synchronization.  ... 
doi:10.1016/j.heares.2008.09.008 pmid:18926894 fatcat:bijdlpxpzfcsrjeefayvvu4pl4

Temporal suppression and augmentation of click‐evoked otoacoustic emissions

Sarah Verhulst, James M. Harte, Torsten Dau
2008 Journal of the Acoustical Society of America  
The scanning and voxellisation of complex 3D objects for incorporation within finite difference time domain based acoustic prediction.  ...  Some new results of a beam tracing approach based on the uncertainty principle. Uwe M.  ...  voices͒ in available music recordings and its applications to singer identification, singer similarity, and lyrics synchronization.  ... 
doi:10.1121/1.2935694 fatcat:z6yidzqqendznbmrczb3woaiv4

Across‐frequency processes involved in auditory detection of coloration

Jörg M. Buchholz, Paris Kerketsos
2008 Journal of the Acoustical Society of America  
The scanning and voxellisation of complex 3D objects for incorporation within finite difference time domain based acoustic prediction.  ...  Some new results of a beam tracing approach based on the uncertainty principle. Uwe M.  ...  voices͒ in available music recordings and its applications to singer identification, singer similarity, and lyrics synchronization.  ... 
doi:10.1121/1.2935741 fatcat:acaporn5qvgb5coauovqkbv6lq
« Previous Showing results 1 — 15 out of 82 results