Cell-Utes and Flutter-Tongued Cats: Sound Morphing Using Loris and the Reassigned Bandwidth-Enhanced Model

Kelly Fitz, Lippold Haken, Susanne Lefvert, Corbin Champion, Mike O'Donnell
2003 Computer Music Journal  
The reassigned bandwidth-enhanced additive sound model is a high-delity sound representation that allows manipulations and transformations to be applied to a great variety of sounds, including noisy and inharmonic sounds. Combining sinusoidal and noise energy in a homogeneous representation, the reassigned bandwidth-enhanced model is ideally suited to sound morphing and is implemented in the open-source software library Loris. This article presents methods for using Loris and the reassigned
more » ... width-enhanced additive model to achieve high-delity sound representations and manipulations, and it introduces software tools that allow programmers (in C/C"and various scripting languages) and non-programmers to use the sound modeling and manipulation capabilities of the Loris package. The reassigned bandwidth-enhanced additive model is similar in spirit to traditional sinusoidal models (McAulay and Quatieri 1986; Serra and Smith 1990; Fitz and Haken 1996) in that a waveform is modeled as a collection of components, called partials, having time-varying amplitude and frequency envelopes. Our partials are not strictly sinusoidal, however. We employ a technique of bandwidth enhancement to combine sinusoidal energy and noise energy into a single partial having time-varying frequency, amplitude, and noisiness (or bandwidth) parameters (Fitz, Haken, and Christensen 2000a). The bandwidth envelope allows us to de ne a single component type that can be used to manipulate both sinusoidal and noisy parts of sound in an intuitive way. The encoding of noise associated with a bandwidth-enhanced partial is robust under time dilation and other model-domain transformations, and it is independent of other partials in the representation. We use the method of reassignment (Auger and Flandrin 1995) to improve the time and frequency estimates used to de ne our partial parameter envelopes. The breakpoints for the partial parameter envelopes are obtained by following ridges on a reassigned time-frequency surface. Our algorithm shares with traditional sinusoidal methods the notion of temporally connected partial parameter estimates, but by contrast, our estimates are non-uniformly distributed in both time and frequency. This model yields greater resolution in time and frequency than is possible using conventional additive techniques and preserves the temporal envelope of transient signals, even in modi ed reconstruction (Fitz, Haken, and Christensen 2000b) .
doi:10.1162/014892603322482529 fatcat:x6d4pl5k2fgszema5vedux7ejq