Extracting singing voice from music recordings by cascading audio decomposition techniques

Jonathan Driedger, Meinard Muller
<span title="">2015</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/rc5jnc4ldvhs3dswicq5wk3vsq" style="color: black;">2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</a> </i> &nbsp;
The problem of extracting singing voice from music recordings has received increasing research interest in recent years. Many proposed decomposition techniques are based on one of the following two strategies. The first approach is to directly decompose a given music recording into one component for the singing voice and one for the accompaniment by exploiting knowledge about specific characteristics of singing voice. Procedures following the second approach disassemble the recording into a
more &raquo; ... e set of fine-grained components, which are classified and reassembled afterwards to yield the desired source estimates. In this paper, we propose a novel approach that combines the strengths of both strategies. We first apply different audio decomposition techniques in a cascaded fashion to disassemble the music recording into a set of mid-level components. This decomposition is fine enough to model various characteristics of singing voice, but coarse enough to keep an explicit semantic meaning of the components. These properties allow us to directly reassemble the singing voice and the accompaniment from the components. Our objective and subjective evaluations show that this strategy can compete with state-of-the-art singing voice separation algorithms and yields perceptually appealing results.
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icassp.2015.7177945">doi:10.1109/icassp.2015.7177945</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/icassp/DriedgerM15.html">dblp:conf/icassp/DriedgerM15</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/awbmpz323bcr5gkd6h35hir4vm">fatcat:awbmpz323bcr5gkd6h35hir4vm</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170410041101/https://www.audiolabs-erlangen.de/content/05-fau/professor/00-mueller/03-publications/2015_DriedgerMueller_SVECascadedDecomposition_ICASSP.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/15/5d/155d2a44cb1bce274702837e113373fa8769c71d.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icassp.2015.7177945"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>