Audio Investigations: DSD loses information, but what is lost? [extended June 14]

Wednesday, June 4, 2014

DSD loses information, but what is lost? [extended June 14]

It's easy to see that sigma delta systems and/or DSD lose information. Consider "canonical DSD" as a purely 1-bit system (which is not actually how the encoders and decoders work, but still describes the datastream well).

At Fs, the standard sampling rate frequency (44.1kHz I believe, or perhaps it's 48kHz), a 16 bit system has 65536 "states." A 1 bit system operating at 64Fs has 2 states per bit times 64 or 128 states--thus it has 65536 / 128 or 1/512 as many possible states. Now that is huge information loss! (Quad DSD has 1/128 as many states as 16 bit digital. The new PS Audio DAC has 10x the DSD rate or 1/51 as many states as 16 bit digital.)

But what information is lost? DSD with noise shifting has standard specifications superior to 16 bit digital in almost every way. It has higher frequency response, lower midband noise, etc. So how can information be lost? And given this huge information loss, how come so few audiophiles notice it and so many audiophiles strongly believe, nevertheless, that DSD is a superior system to 16 bit PCM--and even high resolution PCM (which has far more states than 16 bit digital)? Surely DSD is not a "lossy compression" system like MP3, which has issues which are far clearer on a technical or listening basis. Strangely, DSD is a "lossy expansion" system which uses more bits to encode less information about amplitude, and it's very hard to see what is actually being lost.

The reason the limitations of DSD systems go unnoticed is that human hearing and most technical measurements don't actually deal with small changes in amplitude that do not cause detectable differences in continuous waveforms. At best, humans can only reliably hear differences in overall amplitude down to 0.1dB in the midband, and they are far less sensitive to differences in overall amplitude very high frequencies close to the bandwidth limit of digital systems.

The 128 states that canonical DSD has at Fs represent a dynamic range of about 44dB. 16 bit digital systems have the same dynamic range across the audible spectrum--98dB.

Clearly what is lost with sigma delta systems is dynamic range at the higher frequencies down to about the middle of the frequency spectrum. This is not easy to hear, but many things audiophiles obsess about are not actually that easy to hear either. In a complex recording, high frequency dynamic range is what makes it easier to separate different sounds.

In a photograph, the smallest details represent something called "resolution." That is analogous to what is lost with sigma delta systems, which should also be called resolution.

June 14 further thoughts:

One of the problems with observing this difference in information is that it may not be audible over one listening. There is only so much information a human listener gathers in each listening. Subsequent listenings may gather a slightly disjoint set of information. Possibly never is the sum of all information completed, except for standard recordings.

But clearly the larger the initial amount of information, the more likely variance between the information gathered in the first listening and each subsequent one. This difference makes a recording sound "real" as opposed to "canned." A canned sound may be fine the first time, but then it gets boring, precisely because there is no more information to be gathered, it was all apparent the first time.

So we definitely want a recording to have as much information as possible, particularly in audible ranges, and extending to lesser degrees beyond.

By stepping back from the information available even with far earlier PCM systems, DSD deprives us of information, as of course to lesser degrees generally all sigma delta devices (I' do calculations in a later essay).

No comments:

Post a Comment

Living Room System

2-way with Acoustat 2+2 from 125Hz to 17k, and a pair of SVS Ultra PB 13 below 125Hz (linear phase LR8 crossover). Now using the best from 2 pairs of 2+2's, one white and one black, EQ'd to be the same. Acoustats powered by Aragon 8008 BB. All amps including Subs and Tweeters driven by Emotiva Stealth DC-1 DACs. DACs for each way driven by AES from Behringer 2496 DEQs (EQ and limiting), driven by AES from MiniDSP OpenDRC (linear phase FIR crossovers or corrections generated by RePhase). Driven by AES from a Henry Engineering AES 4-way splitter. Driven by AES from a Tact RCS 2.0 Digital Preamp with Room Correction and PEQ bypassed. The Tact receives most sources as digital. Analog sources are converted to digital at 24/96 by Lavry AD10 analog to digital converter on the side wall and sent via a 12 foot AES cable to the Tact. Emotiva XSP-1 serves as phono preamp and analog selector and level adjuster feeding the Lavry. Oppo UDP 205 plays most discs, and does streaming from my library and Qobuz using Roon, through digital coax output to the Tact, but in other cases through HDMI and a Kanex Pro De-Embedder, and in the rare remaining cases (used to be more) where neither digital transfer method retains full resolution, analog through balanced cables to the Emotiva and Lavry. Denon DVD-9000 plays HDCDs better than anything I know of and connects to Emotiva and thence Lavry for conversion to 24/96. A Sonos Connect is an alternate Tact source, and destination for the Pioneer F-26 FM tuner, which also plays in other rooms through Sonos. A Dynavector 17DIII is mounted to the hockey-tape dampened Ittok tonearm on a Linn Sondek LP12 with Lingo external power supply, using a Michell plastic low spindle clamp is with a trimmed felt washer. There are seperate AC circuits for Subwoofers, Acoustat Poweramps, and Rest of System. The Rest of System circuit is conditioned and regulated by a Panamax MB 1500 and served via a variety of audiophile grade power strips. Single ended digital cabling by Blue Jeans Cable using Belden coax, and Blue Jeans LC-1 audio cabling. AES by Canare, Mogami, Gotham City, and Geistnote/Canare. PTFE balanced audio cable by Surf Cables. Speaker wire mostly hand twisted milspec PTFE coated 12g silver plated copper. Playlists generated by mplay version 0.69 soon to be released.