Perception is not reality

  • In my previous article, I made the case that all recorded music has a certain level of intrinsic production value applied to it, and that value has much more of an impact on the music’s ultimate fidelity than the sample-rate or codec in which it is delivered; two facets of digital music that typically dominate a large portion of current audiophile discourse.

    Most popular music made today has a significantly lower overall production value than that made in the past. This is largely due to the overzealous use of dynamic range compression (DRC) coupled with brickwall limiting. These techniques are employed by recording engineers at the behest of producers, labels, and even the artists themselves, to make the music sound louder. This race to the bottom is aptly named the “Loudness War” and it has been going on since the dawn of recorded music.

    The idea is that if you, the listener, hear two pieces of music back-to-back, you will always gravitate toward the louder one over the quieter one. In fact, louder sounding music initially sounds better to our ears when in reality it’s actually quite the opposite. Ergo, when volume differences present, perception is not always reality. To be a truly critical listener, you have to first check yourself (and your volume control) before you wreck yourself.


    After tens of thousands of years of evolution, the human ear has developed into a truly sensitive listening device, capable of hearing six orders of magnitude in dynamic range (140dB), three orders of magnitude in frequency (20Hz – 20kHz), and twelve orders of magnitude in sound intensity.

    And ideally you were born with two of them, allowing you to enjoy the natural world in 3-D stereo which improves your ability to localise sounds substantially over good’ole mono. Put simply, your ears allow you to eavesdrop on the softest whisper in a crowded room as well as rock out to the loudest scream at a Deathspell Omega concert. However, your hearing is not perfect, and in fact a lot of information can get lost in translation between the point in which air enters your pinna (outer ear), strikes your tymphanium (eardrum/middle ear), and is eventually converted to neural signals through the cochlea (inner ear). For example, if two frequencies close to each other are played simultaneously, there is a very good chance our ears will only detect one of them. This phenomenon is called critical masking and has to do with the fact that despite having wondrous tech specs, the inner workings of our ears still have a finite bandwidth in which they operate. Given the right auditory stimuli our perception of sound can easily be fooled. Same is true for volume.

    Volume is not linear with respect to frequency. In other words, the apparent loudness of a sound depends on its frequency and intensity. This relationship between frequency, intensity, and loudness was first studied in 1933 by Harvey Fletcher and Wilden A. Munson, who published their initial findings in their now seminal paper “Loudness, its definition, measurement and calculation.”

    By 1937, the duo had developed the first equal-loudness contour known as the Fletcher-Munson curve. It was eventually refined in 1956 and became the foundation of the official ISO 226 standard, now referred to as the Equal-Loudness contour. Loudness is measured in Phons – the official unit of measurement for volume and represents the dB SPL of a sound at a frequency of 1kHz that sounds just as loud. Our perception of volume is not a straight line across the audible frequency spectrum.


    Imagine listening to Bolt Thrower at 80dB. Your head is moving back and forth as this Brit-tank steam rolls your eardrums. Life is good. Then you turn the volume knob down and play the same song at 10dB. Take a look at the frequency disparity between the two. At 10dB, anything below 500Hz is pretty much a wash, i.e. below the threshold of audibility. Note how bass (less than 250Hz) and treble (greater than 2kHz) dramatically change in general depending on the number of phons. This graph also explains why historically speaking, most engineers tend to stay around 85dB when they mix, since that’s where the Equal-Loudness contour tends to be the flattest. It’s also why at 80dB, Bolt Thrower sounds richer and fuller than it does at 10dB since the volume across the audible frequency spectrum is more consistent.

    At this point you might be asking yourself why would I take a stand against the Loudness War? Doesn’t DRC and brickwall limiting allow engineers to make the music sound louder and therefore better? Nope.

    Between manipulating the recorded volume and adjusting playback volume there are differences. When a mastering engineer artificially pushes the volume higher by applying massive amounts of DRC, he or she is changing the recorded volume by squashing the high and low ends of the frequency spectrum. Take a look at the two masters from the track “Astrid Falls” from the last Witherscape record.


    Note: the vinyl master waveform is not from a needle drop but is sourced from the actual official digital master that was eventually pressed to wax (thanks Dan!). See all those peeks and valleys on the bottom waveform (the vinyl master)? See how they have been literally chopped off in order to raise the volume of the mid-section of the top waveform (CD master)? By simply mastering this record for maximum volume on CD its transients and imaging were substantially degraded, making the music sound lifeless and dull. More simply put, hyper compression sacrifices fidelity at the expense of volume and irrevocably damages the original recording. Read my “Take the Swanö Challenge” article for a complete lowdown (even better, take the challenge yourself).

    Now that we have a better understanding on how volume can change the way we perceive music, and how artificially raising it too high during the production process can damage it, then it should come as less of a surprise that level matching volume levels is absolutely essential when engaged in critical listening, especially when A/B-ing gear.  Otherwise, comparisons don’t enjoy a level playing field.


    How often have you stumbled across a review that claims some hot new remaster of an album you are intimately familiar with is punchier and livelier than the original (coincidentally, the new packaging and price tag for that remaster is also punchier and livelier as well)? And in that review, how often does the author explicitly state that he or she level matched the original master with the new one during their listening session? Exactly. How do you know the reviewer’s conclusion wasn’t  a result of volume differences? You don’t.

    An instant change in volume between the original and the (newly crushed) remaster can make the latter appear, at least initially, richer and fuller than the original. But once level-matched, a very different and sonically sadder story is told. The Loudness War strikes again!

    You can read more of Alex’s writings over at his Metal-Fi webzine.

    Written by Alex M-Fi

    Alex is co-founder and Chief Editor of, a website dedicated to the head-banging audiophile. His blood type is Type O Negative. Alex derives his income from writing software.

    ELAC tease Roon-enabled amplifier, speakers at CES 2016

    CES 2016: One Audio’s high-end hifi-system-in-a-box