FFT for Producers: How to Read a Spectrum Analyzer | VGP Studio

The screenshot comparison trap

You pull up a spectrum analyzer on your master bus. You load a reference track. You stare at both curves and think: if I just EQ my mix until the shapes match, it'll sound professional. It won't. I've done this. Spent 45 minutes sculpting a frequency curve that looked almost identical to a Drake reference, and the result sounded like someone draped a wet towel over a speaker. The shapes matched. The music didn't. This is the core misunderstanding with spectrum analyzers. They show you real data, but the data is not what you think it is. To use them well, you need to understand what's actually happening under the hood. That means understanding FFT.

What FFT is (without the textbook voice)

Sound in your DAW is a list of numbers. Each number is a sample, a snapshot of air pressure at one instant. That's the time domain: amplitude changing over time. Your waveform view. Familiar territory. But sometimes you want to ask a different question. Not "what's happening at this instant" but "how much energy is sitting at each frequency?" That's the frequency domain. And to get from time to frequency, your analyzer runs an algorithm called the Fast Fourier Transform. The FFT is just a clever, optimized version of the Discrete Fourier Transform (DFT). The DFT formula looks like this:

X[k] = sum_{n=0}^{N-1} x[n] \cdot e^{-j 2\pi k n / N}

In human terms: take your block of N audio samples, multiply each sample by a spinning complex number at frequency k, and add them all up. The result tells you how much energy lives at that frequency. Do this for every k from 0 to N-1, and you get a full frequency snapshot. Each frequency "bin" in the analyzer corresponds to:

ext{frequency} = k \times \frac{f_s}{N}

So if your sample rate is 44,100 Hz and your FFT size is 4096, each bin covers about 10.77 Hz. That's why the low end looks like fat blurry blobs on a small FFT, and why cranking up the FFT size makes low frequencies sharper but makes everything respond slower. There's a real tradeoff here.

The resolution tradeoff that nobody explains well

Bigger FFT size = better frequency resolution. You can distinguish between 100 Hz and 110 Hz. But bigger FFT size also means longer time window. The analyzer needs more samples before it can give you an answer, so fast transients get smeared. Smaller FFT size = faster response. Snare hits show up as quick spikes. But frequency resolution drops. 100 Hz and 200 Hz might land in the same bin.

Audio Specification Matrix SCROLL HORIZONTALLY →

FFT size	Freq resolution (at 44.1 kHz)	Time window
1024	~43 Hz per bin	~23 ms
2048	~21.5 Hz per bin	~46 ms
4096	~10.8 Hz per bin	~93 ms
8192	~5.4 Hz per bin	~186 ms

This is not a bug. It's a fundamental property of how frequency analysis works. You literally cannot have perfect time resolution and perfect frequency resolution at the same time. Heisenberg's uncertainty principle shows up in audio, which is kind of wild when you think about it. Most spectrum analyzer plugins default to something around 4096. That's a decent middle ground. But if you're trying to identify a specific resonant frequency in a bass guitar, bump it up. If you're watching transient behavior, drop it down.

Windowing: the thing you skip past in the settings

When the FFT grabs a block of samples, it assumes that block repeats forever. If the start and end of your block don't line up smoothly (they almost never do), you get spectral leakage. Energy smears across bins that shouldn't have any. Windowing fixes this by fading the edges of each block to zero before the FFT runs. Different window shapes make different compromises:

Hann (sometimes called Hanning): smooth taper, good general purpose, slightly wider frequency peaks

Blackman-Harris: very steep taper, less leakage, but even wider peaks

Rectangular (no window): sharpest peaks, worst leakage

Most analyzers default to Hann, and honestly that's fine for 90% of production work. I've never changed a window function and had it meaningfully alter a mix decision. But knowing why it exists helps you understand why the analyzer sometimes shows energy where you don't expect it.

The 5-minute DAW experiment

Do this right now. It takes five minutes and will change how you read analyzers.

1 Open a new session. Load a simple sine wave generator on a track. Set it to 1 kHz.

2 Put a spectrum analyzer on the same track. Note the single spike at 1 kHz. Clean. Obvious.

3 Now add a second sine wave at 1.05 kHz (50 Hz higher). Watch what happens to the analyzer display. With a small FFT size, you might see one wide bump. Increase the FFT size until two distinct peaks appear.

4 Replace the sine waves with a full mix. Notice how the analyzer becomes a dense, constantly shifting mess. That "mess" is your music. All frequencies present, all the time, overlapping and interacting.

5 Now play your mix and a reference track side by side (gain-matched). The overall shapes will be different. That's okay. Different songs have different frequency distributions. A bright pop mix and a warm jazz record will never look the same, and they shouldn't.

The point: the analyzer is honest about energy distribution. It is completely silent about whether the music is good. Two mixes can have identical spectral shapes and one sounds professional while the other sounds lifeless.

The spectrogram lie (well, not a lie, but close)

Spectrograms are the colorful waterfall displays. Time on one axis, frequency on the other, color representing amplitude. They look incredible. Very scientific. Very convincing. But they suffer from the same resolution tradeoff as the standard analyzer, just painted prettier. And the color mapping is arbitrary. One plugin's "yellow" might be another plugin's "green." The visual impression changes depending on the color palette, the dynamic range of the display, and the update rate. I'm not saying spectrograms are useless. They're great for spotting things like:

Constant hum at a specific frequency (horizontal line that never moves)

Resonances ringing after a transient

Frequency gaps where content drops out

But they're a diagnostic tool, not a creative one. If you're making EQ decisions because the spectrogram looks uneven, you're doing it backwards.

The measurement workflow that actually works

Here's the process I use, stolen partly from Bob Katz and partly from years of getting it wrong: 1. Listen first. Before you open any analyzer, play the mix and write down what bothers you. "Vocals feel buried." "Low end is boomy." "Hi-hats are harsh." Actual words about the listening experience. 2. Form a hypothesis. "I think there's a buildup around 200-300 Hz that's masking the vocal." This is your guess. It might be wrong. 3. Measure. Now open the analyzer. Look at the region you suspect. Is there actually a bump there? Compare with a reference. Does the reference have less energy in that range? 4. Make a change. Cut 3 dB at 250 Hz with a moderate Q. Or whatever your hypothesis suggests. 5. Level-match. This is the step everyone skips. Any EQ cut makes the overall level quieter, and quieter always sounds worse to our brains (Fletcher-Munson curves, look them up). Compensate the gain so the loudness is the same before and after. 6. Decide with your ears. A/B the change. Does it actually sound better, or did you just make the analyzer look neater? If it doesn't sound better, undo it. The graph doesn't matter.

The mistake that wastes the most time

Trying to make your mix's spectrum match a reference track's spectrum. I already mentioned this, but it's worth repeating because it's that common. Here's why it fails: a spectrum analyzer shows the average energy distribution of a complete mix. That includes the arrangement, the instrumentation, the recording quality, the performance dynamics, everything. Two songs in the same genre with the same mastering engineer will still have different spectra because they're different songs with different notes and different performances. When you EQ your master to match a reference curve, you're not fixing your mix. You're warping it to fit a statistical profile that was never meant to be a target. You end up boosting frequencies where your arrangement has natural gaps (there's nothing wrong with gaps) and cutting frequencies where your song has intentional energy. Use references for ballpark sanity checks. "My mix has way more energy below 50 Hz than anything else in this genre" is useful information. "My mix has 1.3 dB less energy at 2.4 kHz than the reference" is noise.

What the analyzer can't show you

Timing. Groove. Whether the snare hits with the right attitude. Whether the vocal delivery is convincing. Whether the arrangement builds tension. Whether the bass note choices are musically interesting. It also can't show you phase relationships, stereo width (not directly, anyway), or dynamic movement over time in any musically meaningful way. A flat, lifeless mix and a punchy, dynamic mix can have identical average spectra. The analyzer is a microscope. It shows you a specific, narrow view of one dimension of your audio. A microscope is a great tool. But you wouldn't use a microscope to judge whether a painting is beautiful.

Producer takeaway

Use spectrum analyzers for confirmation, not exploration. Listen first. Decide what you want to change. Then check whether the data supports your decision. The measurement workflow again, because it's worth memorizing: `Listen → Hypothesis → Measure → Change → Level-match → Decide` If you find yourself staring at the analyzer more than listening to the music, close it. Your ears are the instrument. The FFT is just a flashlight.

FFT for Producers: How to Read a Spectrum Analyzer Without Losing Your Mind

Key Takeaways & Core Concepts