A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation
2011
2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Separating multiple tracks from professionally produced music recordings (PPMRs) is still a challenging problem. We address this task with a user-guided approach in which the separation system is provided segmental information indicating the time activations of the particular instruments to separate. This information may typically be retrieved from manual annotation. We use a so-called multichannel nonnegative tensor factorization (NTF) model, in which the original sources are observed through
doi:10.1109/icassp.2011.5946389
dblp:conf/icassp/OzerovFBD11
fatcat:2v7pkdybmvganhwy2k5apazupa