<-- Back to Previous Page TOC Next Chapter -->

   

Chapter 3: The Frequency Domain

Section 3.6: Some Alternatives to the FFT

We know that in previous sections we’ve made it sound like the FFT is the only game in town, but that’s not entirely true. Historically,the FFT was the first, most important, and most widely used algorithm for frequency domain analysis. Musicians, especially, have taken a liking to it because there are some well-known references on its implementation, and computer musicians especially have gained a lot of experience in its nuances. It’s fairly simple to program, and there are a lot of existing software programs to use if you get lazy. Also, it works pretty well (emphasis on the word "pretty").

The FFT breaks up a sound into sinusoids, but there may be reasons why you’d like to break a sound into other sorts of basic sounds. These alternate transforms can work better, for some purposes, than the FFT. (A transform is just something that takes a list of the sample values and turns the values into a new list of numbers that describes a possible new way to add up different basic sounds.)

For example, wavelet transforms (sometimes called dynamic base transforms) can modify the resolution for different frequency ranges, unlike the FFT, which has a constant bandwidth (and thus is, by definition, less sensitive to lower frequencies—the important ones— than to higher ones!). Wavelets also use a variety of different analysis waveforms (as opposed to the FFT, which only uses sinusoids) to get a better representation of a signal. Unfortunately, wavelet transforms are still a bit uncommon in computer software, since they are, in general, harder to implement than FFTs. One of the big problems is deciding which wavelet to use, and in what frequency bandwidth. Often, that decision-making process can be more important (and time-consuming) than the actual transform!

Wavelets are a pretty hot topic though, so we’ll probably be seeing some wavelet-based techniques emerge in the near future.

Figure 3.34 Some common wavelet analysis waveforms. Each of these is a prototype, or "mother wavelet," that provides a basic template for all the related analysis waveforms.

A wavelet analysis will start with one of these waveforms and break up the sound into a sum of translated and stretched (dilated) versions of these.

McAulay-Quatieri (MQ) Analysis

Another interesting approach to a more organized, information-rich set of transforms is the extended McAulay-Quatieri (MQ) analysis algorithm. This is used in the popular Macintosh program Lemur Pro, written by the brilliant computer music researcher Kelly Fitz.

MQ analysis works a lot like normal FFTs—it tries to figure out how a sound can be re-created using a number of sine waves. However, Lemur Pro is an enhancement of more traditional FFT-based software in that it uses the resulting sine waves to extract frequency tracks from a sound. These tracks attempt to follow particular components of the sound as they change over time. That is, MQ not only represents the amplitude trajectories of partials, but also tries to describe the ways that spectral components change in frequency over time. This is a fundamentally different way of describing spectra, and a powerful one.

MQ is an excellent idea, as it allows the analyst a much higher level of data representation and provides a more perceptually significant view of the sound. A number of more advanced FFT-based programs also implement some form of frequency tracking in order to improve the results of the analysis, similar to that of MQ analysis.

For example, even though a sine wave might sweep from 1 kHz to 2 kHz, it would still be contained in one MQ track (rather than fading in and out of FFT bins, forming what are called lobes).

An MQ-type analysis is often more useful for sound work than regular FFT analysis, since it has sonic cognition ideas built into it. For example, the ear tends to follow frequencies in time, much like an MQ analysis, and in our perception loud frequencies tend to mask neighboring quieter frequencies (as do MQ analyses). These kinds of sophisticated analysis techniques, where frequency and amplitude are isolated, suggest purely graphical manipulation of sounds, and we’ll show some examples of that in Section 5.7.

Figure 3.35  Frequency tracks in Lemur Pro. The information looks much like one of our old sonograms, but in fact it is the result of a great deal of high-level processing of FFT-type data.

Figure 3.36  MQ plot of 0'00" to 5'49" of Movement II: from the electro-acoustic work "Sud" by the great French composer and computer music pioneer Jean-Claude Risset. Photo by David Hirst, an innovative composer and computer music researcher from Australia.

<-- Back to Previous Page Next Chapter -->