Speaker DSP

Speaker DSP and room correction is a hotly debated topic. When not done properly it can certainly do more harm than good but I believe that when done correctly it will always be better than not having it in the system. In order to achieve the highest quality systems, DSP should be used in a properly treated room. Having said that, fully implementing a DSPed system is a significant expense compared to a traditional system.

In my system I use DEQX units to handle this. There are a couple of other options on the market but none which are as extensive or with a fully integrated setup process. Some alternatives include the Acourate programs, some of the miniDSP devices, and Trinnov which provides more limited capabilities. Despite which system you use, implementing it correctly is critical and should be done by an expert. Recently I’ve been handling system calibrations on my own but only after having seen the process done by Larry Owens and Ken Goerres a number of times.

The DSP in my system targets 4 different areas. The first is the room correction in the lower end of the spectrum. Most people will be familiar with this type of correction. Because of minimum phase behavior below the Schroeder transition area, a simple EQ can correct this region. Of course an analog EQ would cause various degradations to the system so ideally it should be done with DSP before any stage of conversion. One issue which people normally run into is trying to overcorrect their low end which results in more harm than good. This form of room correction must be used in combination with acoustic treatment. Peaks can be brought down by a considerable amount but dips should not be brought up by any large amount using DSP due to the fact that any additional energy at those frequencies will cancel itself out. Even rooms with large amounts of acoustic treatment will have variations in the low end. The main purpose of the acoustic treatment is to bring up the dips as well as to control the decay time. Once this has been achieved, the DSP handles the last bit of flattening the response curve.

The next area which the DSP in my system treats is the higher frequency correction of my speakers. This is largely a correction of the driver response on its own but factors some room anomalies in as well. Many correction systems try to correct the entire response (speaker + room) which results in many issues as a simple response curve does not accurately represent everything that occurs over several hundred milliseconds. The room treatment should be responsible for what happens to the sound beyond the direct sound from the speakers and most attempts to correct this digitally result in strange behavior. Having good off-axis behavior in a speaker is critical for this reason.

Correcting the response of the drivers in the speaker on their own affords many benefits which most speakers on the market do not have. Most speaker designers must aim to create a system that is relatively flat based on the response curves of the drivers they use. This limits their choices quite a bit and they must often sacrifice other aspects of the speaker’s performance. By using DSP correction, the response curve of each driver is fairly trivial which allows the drivers to be selected based on other characteristics. This includes distortion specs, dispersion characteristics. and any other anomalies. By prioritizing these aspects over a naturally flat response curve, an incredibly detailed speaker with extremely low distortion can be created. Control over the dispersion characteristics also creates many new possibilities such as the crossfire horns in my speakers which result in a sweetspot that’s wider than anything else around. More on that in another post.

This type of correction requires filtering beyond what typical IIR filters provide. The DEQX uses a combination of FIR and IIR filters to accomplish this. This allows for the phase and timing domain to be controlled separately from the frequency domain to correct timing anomalies in each driver.

The next aspect which the DEQX controls in my system are the crossovers. By using DSP crossovers, there is no filtering that happens after the conversion stage. Neither active nor passive crossovers. This eliminates the distortions caused by those circuits. Very few designers can create excellent passive crossovers and even then, they cause amplifiers to behave oddly. In most cases active crossovers will perform better than passive but also introduce distortions of their own. Both active and passive crossovers have severe limitations on the filter shapes which they can produce. On top of this, most designers will apply basic filter shapes without taking into account the response curves of the speaker drivers.

By using FIR filters, the DEQX is able to create perfect linear phase crossovers ranging from 48dB/octave to 300dB/octave. This results in no phase distortion which is not possible in other crossovers. The benefit of using steep crossovers (which also isn’t possible in other systems) is that there is little overlap between speaker drivers so the imaging is incredibly precise. The resulting crossovers are completely invisible.

The last area which DSP contributes to my system is in defining the low end of my speakers. Without DSP, my speakers would have a very strange frequency response which is by design. The tuning frequency is below where the drivers naturally roll off. By using DSP, the drivers are extended lower where they meet the tuning frequency of the cabinet. This wizardry adds roughly 10Hz getting the speakers down to 20Hz.

One last point to note is that any ported system should use a high pass filter below the tuning frequency. Without it, those frequencies will cause severe distortion without producing any output from the speakers. By adding a high pass filter, you’ll end up with a cleaner and tighter low end response which appears to extend lower. This is not possible with passive filters so must be done at least with an active filter but ideally with DSP.

The only real drawback to DSP when done correctly is the added expense and system complication. In my particular case, my left and right speakers alone require 6 channels of DACs and amps. The DSP does add latency, around 24ms in my case, so it’s not usable for most tracking but it’s not an issue for mixing and mastering where some plugins add several times that amount of latency.

If you have any questions, want to implement some DSP in your system, or still think that no DSP is better, get in touch.

Intro to My System

In this post I’ll try to briefly explain the basics of how my system is set up and will dive deeper into certain aspects in later posts. Don’t be discouraged by any aspects you don’t understand and fee free to reach out to me with any questions about my system or implementing similar concepts into your own.

One of the things I strive for in my system is a modularity which allows me to alter my setup for any projects I have. While lately I’ve mainly done mastering, I often work on projects involving recording, mixing for singles/albums, score mixing, and orchestration.

Although I work entirely in the box, my setup is quite complex in order to achieve the quality of monitoring that I have with what I call “fully digital monitoring.” Everything starts at my PC which I built specifically to handle mastering projects at higher sample rates. For mixing and orchestration work, my previous PC was more than adequate but it suffered to run plugins such as ones from Acustica in series at 96kHz and 192kHz mastering situations.

The PC is connected to an RME Digiface USB. The RME TotalMix is fundamental to my workflow involving complex routing, presets, and monitor level control. I required an interface with at least 4 separate stereo digital outs plus ADAT for higher track counts when needed and this interface was the only one on the market that could handle this (unless I went with MADI plus a separate unit to break that up into separate digital outputs). Although Toslink has inferior jitter figures, the galvanic isolation is great as my audio system is completely separate from my PC at an electrical level. The reclocking done by the rest of my gear in combination with using glass fiber cables (as opposed to the regular plastic Toslink cables) means that this surpasses what most would achieve with a coaxial system.

The first output from my interface goes to my main stereo speakers. The Toslink first feeds into a DEQX. This handles correction filters for each driver, linear phase crossovers, and room EQ using a combination of FIR and IIR filters. The converters on this particular DEQX have been modified by Larry Owens to sound outstanding. The DEQX has a separate output for each driver meaning that it has 6 DAC channels. The high channels go to a THX 789 amp while the mid and low outputs feed a custom Hypex amp I built. These 6 channels of amplification feed their respective drivers on my left and right speakers. That’s a compression driver in a custom horn, a 12” mid, and a 15” sub. These speakers can be thought of as 2-way plus subs rather than 3-way as they aren’t set up as most 3-way speakers on the market. The result is astonishing sound quality that extends below 20Hz. I will go into much more detail on the DEQX and my speakers in later posts.

The second output from my interface goes to a second DEQX which gets used for my center speaker and LFE subwoofer. The center speaker is biamped with a Hypex module and the LFE sub is powered by a modified Behringer NX3000 amp which runs silently. While I don’t typically use Behringer gear in a high end system, this is one of the few amps on the market offering more than 3000W at less than $5000 and which can be run without absurdly loud fans. Considering that it’s only for LFE use, I can live with a less than fantastic amp.

The third output goes to my surround speakers which are the JBL 705p. These have their own EQ and time correction built in. The entire 5.1 system is time aligned to well within 1ms. All in all, there’s about 5750W of power for all of the speakers. While I have used class A and class AB amps in the past, nowadays it’s all class D other than the THX amp.

The last output from the interface goes to an RME ADI-2 DAC for use with my headphones. The only time I work on headphones other than while travelling is for QC work when mastering. This last output also gets used for any other speakers I want to set up in the room such as breaking up the ADAT into multiple channels via another unit in order to work in 7.1.2 for Dolby Atmos projects but that gear is normally not in the room.

Beyond the monitoring system mentioned, the only other relevant piece of gear is my Add-Powr Sorcer 4x which works in mysterious was to improve all of the gear in my system. More detail in a later post…

The concept of fully digital monitoring comes from the fact that in my system, all crossovers are DSP so the only stage of conversion happens after the crossover for each driver. The DACs feed straight into the amps which feed straight into each driver. There are no active or passive crossovers, extra stages of conversion, or anything else in the signal path to futz with the sound quality. It doesn’t get any better than this.

In terms of the modularity to the system, there are a couple of things I bring in and out of the room to accommodate different projects. As mentioned already, additional speakers can be brought in for working in formats past 5.1. When working on orchestration projects I’ll typically bring in a midi controller keyboard to sit under my desk cart as well as other control surfaces. For recording, a separate rig is used which plugs in via ADAT and is normally sufficient as I rarely record anything more than the occasional singer. Having said that, I have plans of installing a more permanent recording rig into my main system consisting of an AEA TRP2, Gyratec tube preamp, and RME ADI-2 FS.

As I mainly deal with mastering, I value having an incredibly low noise floor in my room. In order to achieve this, measures include being in an underground room built with isolation in mind, dead quite fans in my computer (the motherboard itself buzzes louder than the fans…), no mechanical hard drives, and pads built into cables and amps to optimize the hiss in my speakers while maintaining my calibrated working levels. In addition to this, a dedicated power line isolates from the noise generated by the lights in the room.

There’s no studio I’d rather be in.

I Know Everything...

… actually I don’t. Take what I say with a grain of salt. What I’ll be sharing is mainly based on my own empirical observations and research. To your own research and most importantly, do you’re own listening!

New posts to follow.