Filters








259 Hits in 6.2 sec

Automatic Lyrics Alignment and Transcription in Polyphonic Music: Does Background Music Help? [article]

Chitralekha Gupta, Emre Yılmaz, Haizhou Li
<span title="2019-10-22">2019</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
Automatic lyrics alignment and transcription in polyphonic music are challenging tasks because the singing vocals are corrupted by the background music.  ...  In this work, we propose to learn music genre-specific characteristics to train polyphonic acoustic models.  ...  Singing vocal extraction vs. polyphonic audio Earlier approaches to lyrics transcription have used acoustic models that were trained on solo-singing audio.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1909.10200v2">arXiv:1909.10200v2</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/6sc6dywp6jcvzhf35pyyluly3a">fatcat:6sc6dywp6jcvzhf35pyyluly3a</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200916130720/https://arxiv.org/pdf/1909.10200v2.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/c5/12/c51237bc109ddc6d0b5f5bd3db224e70fd341b73.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1909.10200v2" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Music-Robust Automatic Lyrics Transcription Of Polyphonic Music

Xiaoxue Gao, Chitralekha Gupta, Haizhou Li
<span title="2022-06-07">2022</span> <i title="Zenodo"> Zenodo </i> &nbsp;
Lyrics transcription of polyphonic music is challenging because singing vocals are corrupted by the background music.  ...  We show that these two sets of features complement each other, and their combination performs better than when they are used alone, thus improving the robustness of the acoustic model to the background  ...  Rather than directly applying a solo-singing acoustic model to polyphonic data, polyphonic audio adaptation [22] techniques are used to adapt a model trained on a large amount of solo singing data with  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.6573304">doi:10.5281/zenodo.6573304</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ukrvgk2ebnfqdd4ouvkww5omnu">fatcat:ukrvgk2ebnfqdd4ouvkww5omnu</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20220525033021/https://zenodo.org/record/6573305/files/42.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/b0/09/b00954178eb1937d187c7e3af3ad6f51290d9196.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.6573304"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> zenodo.org </button> </a>

Automatic Recognition of Lyrics in Singing

Annamaria Mesaros, Tuomas Virtanen
<span title="">2010</span> <i title="Springer Nature"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/tzakietxejgppjzsrojed7bkke" style="color: black;">EURASIP Journal on Audio, Speech, and Music Processing</a> </i> &nbsp;
The recognizer is used to align textual lyrics to vocals in polyphonic music, obtaining an average error of 0.94 seconds for line-level alignment.  ...  The system is targeted to both monophonic singing and singing in polyphonic music. A vocal separation algorithm is applied to separate the singing from polyphonic music.  ...  Section 4 presents two applications: automatic alignment of audio and lyrics in polyphonic music and a small-scale query-by-singing application.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1155/2010/546047">doi:10.1155/2010/546047</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/64uir6egxzgkjhdnc3gwn4qnbe">fatcat:64uir6egxzgkjhdnc3gwn4qnbe</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170809154856/https://asmp-eurasipjournals.springeropen.com/track/pdf/10.1155/2010/546047?site=asmp.eurasipjournals.springeropen.com" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/52/1e/521efaa1bec48d3f510908d3bfb55a9083cd3184.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1155/2010/546047"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> hindawi.com </button> </a>

Automatic Recognition of Lyrics in Singing

Annamaria Mesaros, Tuomas Virtanen
<span title="">2010</span> <i title="Springer Nature"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/tzakietxejgppjzsrojed7bkke" style="color: black;">EURASIP Journal on Audio, Speech, and Music Processing</a> </i> &nbsp;
The recognizer is used to align textual lyrics to vocals in polyphonic music, obtaining an average error of 0.94 seconds for line-level alignment.  ...  The system is targeted to both monophonic singing and singing in polyphonic music. A vocal separation algorithm is applied to separate the singing from polyphonic music.  ...  Section 4 presents two applications: automatic alignment of audio and lyrics in polyphonic music and a small-scale query-by-singing application.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1186/1687-4722-2010-546047">doi:10.1186/1687-4722-2010-546047</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/lksnvnccivd7ziqyoe3ld2u5lm">fatcat:lksnvnccivd7ziqyoe3ld2u5lm</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170809154856/https://asmp-eurasipjournals.springeropen.com/track/pdf/10.1155/2010/546047?site=asmp.eurasipjournals.springeropen.com" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/52/1e/521efaa1bec48d3f510908d3bfb55a9083cd3184.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1186/1687-4722-2010-546047"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> springer.com </button> </a>

Genre-conditioned Acoustic Models for Automatic Lyrics Transcription of Polyphonic Music [article]

Xiaoxue Gao, Chitralekha Gupta, Haizhou Li
<span title="2022-04-07">2022</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
In this work, we propose to transcribe the lyrics of polyphonic music using a novel genre-conditioned network.  ...  Lyrics transcription of polyphonic music is challenging not only because the singing vocals are corrupted by the background music, but also because the background music and the singing style vary across  ...  Another way is to use acoustic model trained on clean singing vocals, and at the time of inference, apply source separation technique to extract singing vocal from the input polyphonic song to transcribe  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2204.03307v1">arXiv:2204.03307v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/rf3emyqtfjeuvkdnr24u37eiwe">fatcat:rf3emyqtfjeuvkdnr24u37eiwe</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20220419184521/https://arxiv.org/pdf/2204.03307v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/f8/4c/f84c821065692a4c94edad59838342bdc3246042.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2204.03307v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

End-to-end lyrics Recognition with Voice to Singing Style Transfer [article]

Sakya Basak, Shrutina Agarwal, Sriram Ganapathy, Naoya Takahashi
<span title="2021-02-17">2021</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
Automatic transcription of monophonic/polyphonic music is a challenging task due to the lack of availability of large amounts of transcribed data.  ...  The V2S model based style transfer can generate good quality singing voice thereby enabling the conversion of large corpora of natural speech to singing voice that is useful in building an E2E lyrics transcription  ...  [3] attempted an adaptation of the models trained from solo music to polyphonic music for lyrics alignment.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2102.08575v1">arXiv:2102.08575v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ow634qefczfnnctl7czmexylbm">fatcat:ow634qefczfnnctl7czmexylbm</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210219011458/https://arxiv.org/pdf/2102.08575v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/8d/69/8d69b3ecc2dee85fc12b30afd31ed2bf1e1e4440.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2102.08575v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Singing information processing based on singing voice modeling

Masataka Goto, Takeshi Saitou, Tomoyasu Nakano, Hiromasa Fujihara
<span title="">2010</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/rc5jnc4ldvhs3dswicq5wk3vsq" style="color: black;">2010 IEEE International Conference on Acoustics, Speech and Signal Processing</a> </i> &nbsp;
Common signal processing techniques for modeling singing voices that are used in these systems, such as techniques for extracting the vocal melody from polyphonic music recordings and modeling the lyrics  ...  We then introduce music information retrieval systems based on similarity of vocal melody timbre and vocal percussion, and singing synthesis systems.  ...  Singer ID: Singer identification for polyphonic music recordings Our Singer ID system automatically identifies the name of the singer who sang the input song in the form of polyphonic musical audio signals  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icassp.2010.5495212">doi:10.1109/icassp.2010.5495212</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/icassp/GotoSNF10.html">dblp:conf/icassp/GotoSNF10</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/3xlmcazibngltmkxlw2kv5c35m">fatcat:3xlmcazibngltmkxlw2kv5c35m</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20121211045054/http://staff.aist.go.jp/m.goto/PAPER/ICASSP2010goto.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/4c/0e/4c0e40a76b3552292618e5d6a157b320ad1c45a3.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icassp.2010.5495212"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

On The Use Of Note Onsets For Improved Lyrics-To-Audio Alignment In Turkish Makam Music

Georgi Dzhambazov, Ajay Srinivasamurthy, Sertan Sentürk, Xavier Serra
<span title="2016-08-07">2016</span> <i title="Zenodo"> Zenodo </i> &nbsp;
The goal of automatic lyrics-to-audio alignment is to generate a temporal relationship between lyrics and recorded singing.  ...  "On the use of note onsets for improved lyrics-to-audio alignment in Turkish Makam music", 17th International Society for Music Information Retrieval Conference, 2016. singing tradition [12] .  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.1415987">doi:10.5281/zenodo.1415987</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/i7qivzgv7jblhlk2jskjsvhdhq">fatcat:i7qivzgv7jblhlk2jskjsvhdhq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20201225115249/https://zenodo.org/record/1415988/files/DzhambazovSSS16.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/6f/6e/6f6e39048294da950e2458761bcb2ba305a24c95.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.1415987"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> zenodo.org </button> </a>

Recognition of phonemes and words in singing

Annamaria Mesaros, Tuomas Virtanen
<span title="">2010</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/rc5jnc4ldvhs3dswicq5wk3vsq" style="color: black;">2010 IEEE International Conference on Acoustics, Speech and Signal Processing</a> </i> &nbsp;
The word-level language model is estimated from a textual lyrics database. In the recognition we use a hidden Markov model based phonetic recognizer adapted to singing voice.  ...  On clean singing the phoneme recognition accuracies varied from 20% (no language model) to 39% (bigram) and on polyphonic music from 6% (no language model) to 20% (bigram).  ...  Authors of [4] use a speech recognizer adapted to singing voice to align lyrics with segregated vocals for japanese pop songs, with a language model containing the sequence of the vowels in the lyrics  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icassp.2010.5495585">doi:10.1109/icassp.2010.5495585</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/icassp/MesarosV10.html">dblp:conf/icassp/MesarosV10</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/oaxjgz3upzbfljmupuz5pxqxjq">fatcat:oaxjgz3upzbfljmupuz5pxqxjq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190713220226/http://www.cs.tut.fi:80/~mesaros/pubs/singrec.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/45/f7/45f70b1d5262b4979932842876180275e97f37de.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icassp.2010.5495585"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

MSTRE-Net: Multistreaming Acoustic Modeling for Automatic Lyrics Transcription [article]

Emir Demirel, Sven Ahlbäck, Simon Dixon
<span title="2021-08-05">2021</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
First, we suggest using recordings from both monophonic and polyphonic domains during training the acoustic model.  ...  This paper makes several contributions to automatic lyrics transcription (ALT) research.  ...  composing music, audio/video/music score captioning and editing, lyrics alignment, music catalogue creation, etc.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2108.02625v1">arXiv:2108.02625v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/s2idcjt3bbhkdka27bsbefspty">fatcat:s2idcjt3bbhkdka27bsbefspty</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210807035550/https://arxiv.org/pdf/2108.02625v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/91/2c/912c7e3dea3066cae958734df42d46a82c11ae74.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2108.02625v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Three techniques for improving automatic synchronization between music and lyrics: Fricative detection, filler model, and novel feature vectors for vocal activity detection

Hiromasa Fujihara, Masataka Goto
<span title="">2008</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/2omreisfsje33bgvx3orrdifre" style="color: black;">Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing</a> </i> &nbsp;
Three techniques are described that improve a previously developed system for automatically synchronizing lyrics with musical audio signals.  ...  Although this system achieves state-of-the-art accuracy by extracting vocal vowels from polyphonic sound mixtures and using forced alignment between those vowels and a phoneme network of the lyrics, there  ...  Before the forced alignment is executed, each phone model (HMM) is adapted to singing voices in the input audio signals by using the MLLR and MAP adaptation techniques.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icassp.2008.4517548">doi:10.1109/icassp.2008.4517548</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/icassp/FujiharaG08.html">dblp:conf/icassp/FujiharaG08</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/f7pqtd624fb23hqdhrkbmmmc24">fatcat:f7pqtd624fb23hqdhrkbmmmc24</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20121211031333/http://staff.aist.go.jp/m.goto/PAPER/ICASSP2008fujihara.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/43/0c/430ccf7b9bb6dc618f731fab619b63e1c72e4486.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icassp.2008.4517548"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

MSTRE-Net: Multistreaming Acoustic Modeling for Automatic Lyrics Transcription

Emir Demirel, Sven Ahlbäck, Simon Dixon
<span title="2021-11-07">2021</span> <i title="Zenodo"> Zenodo </i> &nbsp;
First, we suggest using recordings from both monophonic and polyphonic domains during training the acoustic model.  ...  This paper makes several contributions to automatic lyrics transcription (ALT) research.  ...  composing music, audio/video/music score captioning and editing, lyrics alignment, music catalogue creation, etc.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.5624642">doi:10.5281/zenodo.5624642</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/zcrgymzdybctzgv23e4mcvsk3u">fatcat:zcrgymzdybctzgv23e4mcvsk3u</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20211101232451/https://zenodo.org/record/5624643/files/000018.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/a3/76/a3769044006176ec45b07aa8266ffac2e3e54c61.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.5624642"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> zenodo.org </button> </a>

LyricSynchronizer: Automatic Synchronization System Between Musical Audio Signals and Lyrics

Hiromasa Fujihara, Masataka Goto, Jun Ogata, Hiroshi G. Okuno
<span title="">2011</span> <i title="Institute of Electrical and Electronics Engineers (IEEE)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/aznf273kcvcbfjcdeghr3xjd6i" style="color: black;">IEEE Journal on Selected Topics in Signal Processing</a> </i> &nbsp;
This paper describes a system that can automatically synchronize polyphonic musical audio signals with their corresponding lyrics.  ...  for constructing robust phoneme networks, a method for detecting fricative sounds, and a method for adapting a speech-recognizer phone model to segregated vocal signals.  ...  We proposed a method for adapting a phone model for speech to separated vocal signals. This method was useful for music and lyric alignment as well as for recognizing lyrics in polyphonic music.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/jstsp.2011.2159577">doi:10.1109/jstsp.2011.2159577</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/o3zzphl6o5dzfllf7ujuckms7y">fatcat:o3zzphl6o5dzfllf7ujuckms7y</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20121211051112/http://staff.aist.go.jp/m.goto/PAPER/IEEEJSTSP201110fujihara.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/32/23/32235c9622adf5c4ab5a52789a0b6ba9253ac973.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/jstsp.2011.2159577"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Adaptation Of A Speech Recognizer For Singing Voice

Annamaria Mesaros
<span title="2009-08-24">2009</span> <i title="Zenodo"> Zenodo </i> &nbsp;
Audio-to-lyrics alignment In addition to phooneme recognition, the previously developed application, audio to lyrics alignment [4] can be used to test the adapted models.  ...  Our previous work tackles the automatic alignment of polyphonic music with textual lyrics in English, by using a speech recognizer [4] .  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.41584">doi:10.5281/zenodo.41584</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ny2cjl6shrgrfd6jmlipsf2do4">fatcat:ny2cjl6shrgrfd6jmlipsf2do4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170814003541/http://www.eurasip.org/Proceedings/Eusipco/Eusipco2009/contents/papers/1569191780.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/79/bd/79bdbf72121202a485a52e9a6d2ad5f0342457d8.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.41584"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> zenodo.org </button> </a>

Knowledge-Based Probabilistic Modeling For Tracking Lyrics In Music Audio Signals

Georgi Dzhambazov, Xavier Serra
<span title="2017-06-28">2017</span> <i title="Zenodo"> Zenodo </i> &nbsp;
Using the proposed models sung lyrics are automatically aligned to written lyrics on datasets from Ottoman Turkish makam and Beijing opera, whereby principles, specific for these music traditions are considered  ...  We consider not only the low-level acoustic characteristics, representing the timbre of the sung phonemes, but also higher-level music knowledge, that is complementary to lyrics.  ...  Acknowledgements This motivated us to take the opportunity to consider the deep MLP model the authors trained from amateur singers in their subsequent work - (Kruspe, 2016) .  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.841980">doi:10.5281/zenodo.841980</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/tohf6dcvobhe3ei77nvp3wg3ba">fatcat:tohf6dcvobhe3ei77nvp3wg3ba</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20201225003644/https://zenodo.org/record/841980/files/PhDThesis_Georgi_Knowledge_based_Lyrics_Tracking.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/ed/bd/edbd14fe7ef96509620efa7be62cc3dc11434a5f.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.841980"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> zenodo.org </button> </a>
&laquo; Previous Showing results 1 &mdash; 15 out of 259 results