AUDIO SIGNAL REPRESENTATIONS FOR FACTORIZATION IN THE SPARSE DOMAIN

Manuel Moussallam, Laurent Daudet, Gael Richard
2011 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)  
In this paper, a new class of audio representations is introduced, together with a corresponding fast decomposition algorithm. The main feature of these representations is that they are both sparse and approximately shift-invariant, which allows similarity search in a sparse domain. The common sparse support of detected similar patterns is then used to factorize their representations. The potential of this method for simultaneous structural analysis and compressing tasks is illustrated by preliminary experiments on simple musical data.
doi:10.1109/icassp.2011.5946453 dblp:conf/icassp/MoussallamDR11 fatcat:htgzt2obh5fm3fnltzsts4xryi