A recent release of Zoom has brought, among other things, “High Fidelity Audio Mode” (High fidelity music mode in the application settings) to the Windows and Mac clients. The feature was announced in August and rolled out publicly on September 1, 2020 with the 5.2.2 update.
Musicians who have been using Zoom and other videoconferencing platforms for sessions, rehearsals, and performances since the coronavirus pandemic began earlier this year have struggled against an audio system that is optimized for spoken word. The new mode improves audio quality in ways that make the platform work much better for musicians.
The new high fidelity music mode can be enabled in Zoom Settings > Audio > Advanced > Show in-meeting option to “Enable Original Sound” from microphone > High fidelity music mode.
Turning on this setting will disable Zoom’s aggressive dynamic compression, which eliminates all but the loudest one or two sources in a meeting — useful for discussions, annoying for choral singing. It also allows Zoom to use a higher quality data compression than with the setting off, though this still caps out at a 192kbs for a stereo feed.
In a practical sense, this will mean that if you’re discussing a project with collaborators or in a remote recording session, routing audio from your scoring or audio application into Zoom (see our post on how to set this up in Sibelius, Finale, Dorico, or Musescore), listeners on the other end of the conference will hear the difference between softs and louds, and the subtleties of articulation and orchestration will be rendered more faithfully. Teachers will be able to more easily talk alongside playing or audio playback from a student without Zoom squashing either the talking or the music. Even something as simple as playing back recorded music examples without resorting to an external service will be greatly improved using the new audio setting.
While you’re tweaking these settings, you might also want to turn off Automatically adjust microphone volume and turn on Disable echo cancellation, both of which might also cause issues for musicians.
In their initial blog post in August announcing the new music mode, Zoom noted that “This mode will require a professional audio interface, microphone, and headphones to allow you to offer high-quality private lessons.” In my initial tests, this didn’t seem to be a requirement in turning on the feature (technologically, I’m not sure there’s a way for Zoom to know the properties of my microphone and audio interface).
However, I did notice that any imperfections in the setup, like background noise or hums, became much more apparent. And my interpretation of Zoom’s caveat above is not that you can’t use high fidelity music mode without a few hundred dollars worth of new gear. You can turn it on and hear some notable improvements, but you’ll also hear limitations. So to get the most out of the new setting, it would be better to have something that’s a step up from the built-in mic on your laptop. And always, always use wired headphones and when possible, a wired network connection.
Finally, it is worth heeding the warning that enabling high fidelity audio “can increase CPU utilization and consume greater network bandwidth,” and that for “best results, an ethernet connection (not wifi) is strongly recommended.” If you’re running Zoom to broadcast, say, Logic Pro and Sibelius all loaded up with fancy sample libraries, you may need to make choices about what to sacrifice if your computer or connection start to choke.
Considering the rapidly evolving landscape of real-time audio collaboration tools that musicians and music teachers are swimming in at the start of the school year (in the northern hemisphere), I think the new Zoom audio features are a huge step forward in quality and simplicity, if you have the gear and tech specs to support them.
I have spent a fair amount of time working with Cleanfeed over the last couple of months. Cleanfeed is a web-based, realtime, remote audio collaboration platform that runs in a browser and can work for multi-point sessions. It is a bit tedious to set up larger sessions with many users. And since it doesn’t handle video at all, I was only able to use it alongside Zoom, and never instead of Zoom. That complexity was worth the increase in quality for my recent experiments with writing music for remote chamber ensemble, Music for Social Distancing (my paper about the work).
While Zoom’s new audio quality isn’t quite as high as Cleanfeed, the added convenience and simplicity more than makes up for what small quality differences exist. It’s not a perfect A-to-B comparison since they’re using different compression algorithms, so some things might sound better on one than another. And depending on your setup and the setups of your collaborators or students, I’m not convinced everyone would notice a difference at all based on my brief testing.
It’s also worth noting that this update is focused on fidelity, not latency. You will won’t be perfecting that intricate rhythmic play in Ligeti’s Six Bagatelles or Ravel’s String Quartet in F in your remote chamber ensemble rehearsals, at least not over Zoom.
The update is available on Windows and macOS (not mobile) from the Zoom downloads page or by checking for updates within the application.
Originally published at Scoring Notes, 9/2/2020. Even more content is available in a podcast David published on this very topic.
8 thoughts on “Better Music Experiences Come To Zoom With High Fidelity Audio”
hmm, has anyone actually noticed a difference? I see 48k going out, but it still sounds like a clock radio on the far end.
This goes for apps like spotify, routing audio from my interface directly to Zoom and using original audio, as well as attempting to share “computer audio” and using a third party app to route the audio to the Zoom Audio Device.
If i route the sound direct to my sound output, it sounds fine and not at a low bitrate…
I think it’s been 48k this whole time. The sample rate isn’t going to have a huge impact in the perceived audio quality at this level. The difference is in bit rate (which is unrelated to sample rate). The quality of the audio improves quite a bit with this update, but the biggest changes are around the new feature removing dynamic compression (not to be confused with data compression) which prevented more than one person from being heard simultaneously. Also remember that you need to toggle “Original sound” for each new Zoom call, or at least I do. Going into settings just adds the toggle; you still need to turn it on for each new session.
Interestingly enough, I just isolated my issue to a mac vs PC issue… my mac sounds tinny and hollow… my PC has full quality sound output to zoom. Go figure. Think something is broken with one of the latest updates, or perhaps my own system.
That is pretty weird. I’m also on a Mac and haven’t seen anything like you’re describing.
I agree with you on the vagueness of Zoom’s requirements, specifically the “professional audio interface. ” My suspicion is that “professional” translates to “audio input must support 48 khz sample size”.
In my experience (mostly USB mics and Apple products) the DAC is built-in/integrated with the microphone (including recent webcams, including apple monitors and macbook pros, probably other models).
My guess based on that is a lot of people using a either a USB mic or a Mac would not necessarily realize they already have a qualifying audio interface (48 khz is only arguably a ‘professional’ sample rate and it’s available on a lot of hardware as far as I can tell).
Mac users can run Audio Midi setup (Applications > Utilities) to check their audio inputs for supported sample rates (recent apple products seem to support 44.1 khz and 48khz).
Here’s an article on how to set the input to 48 khz on various the devices: https://www.provideocoalition.com/48-khz-how-to-set-it-in-android-ios-macos-and-windows/
Pretty much any microphone or audio interface can be set to either, and in most cases the software can force the hardware/firmware to make the change.
48khz sample rate isn’t really “professional”, it’s just the sample rate for an audio track that will be synchronized to video. 48k is used because it (unlike 44.1k) is evenly divisible by all the common video frame rates (24 and 30 in the US, 25 in the rest of the world, and 60 for Peter Jackson). I would bet that Zoom is doing 48k audio for everything regardless of what settings are selected.
The only thing that might cause an application like Zoom to be unsuccessful in setting a new sample rate is if there is another app that is running and requiring 44.1k, like an app that might be recording into a 44.1k project. (Changing sample rates after recording is not ideal!)
Thank you for decoding the messaging around this update. I am a music teacher. I don’t currently have a DAW and my students most definitely don’t. Should I still advise them to opt for high fidelity mode? Your points about ethernet and wired headphones are well taken. Thanks again for your help navigating this new reality.
I would definitely try turning it on on both ends. Contrary to some of the statements from Zoom, you can still get a lot out of the new setting without having any fancy gear.