Make your noisy recording sound like pro audio with Adobe’s free AI tool

admin

December 20, 2022

0 Views 0

SaveSavedRemoved 0

An illustration of a microphone provided by Adobe. — Enlarge / Adobe’s Enhance Speech service can remove background noise from certain voice recordings.

Adobe

Recently, Adobe released a free AI-powered audio processing tool that can enhance some poor-quality voice recordings by removing background noise and making the voice sound stronger. When it works, the result sounds like a recording made in a professional sound booth with a high-quality microphone.

The new tool, called Enhance Speech, originated as part of an AI research project called Project Shasta. Recently, Adobe rebranded Project Shasta to Adobe Podcast.

Using Enhance Speech is free, but it requires creating an Adobe account and works best with a desktop web browser. Once registered, users can upload an MP3 or WAV file up to one hour long or 1GB in size. After several minutes, you can listen to the result in your browser or download the resulting cleaned-up audio.

In our tests with the service, Enhance Speech worked best with audio that contained a voice without crosstalk or excessive noise. For example, we recorded audio from an iMac’s built-in microphone of a person standing 10 feet away, including fan noise nearby, and the resulting audio (once processed by Enhance Speech) sounded like it had been recorded up close in a noise-free studio with a professional microphone.

Enlarge / Enhance Speech allows uploading MP3 or WAV files up to 1GB in size or one hour long.

Adobe

How does it work? Adobe did not provide any details, but we suspect that the company trained a deep learning model on many (possibly thousands) of hours of clean and noisy audio. The model could then “learn” to pick out the human voice frequencies and synthesize a facsimile that accurately matches the source. This is speculation until Adobe provides more technical details, and we have reached out to the company for comment.

On that count, some Hacker News commenters have reported hallucinated results—unexpected output like phantom voices where the AI misinterprets the input audio—from extremely noisy audio (such as speech recorded beside a waterfall) or from non-English language sources, which suggests that Enhance Speech is doing more than just a conventional noise reduction technique.

Enhance Speech isn’t the first tool to provide this kind of AI-powered noise reduction capability. An open source package called mayavoz and a commercial service called Audo Studio do something similar, for example.

It’s worth noting that Enhance Speech is part of a larger group of AI-powered podcasting tools from Adobe, including a Mic Check tool (currently available for free as well) and a transcript-based audio editing tool that is still undergoing an invitation-only beta test.

Source link

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Make your noisy recording sound like pro audio with Adobe’s free AI tool

Swatters used Ring cameras to livestream attacks, taunt police, prosecutors say

Power plant pollution higher in neighborhoods subject to racist redlining

Sedimentation threatens to steal capacity from nearly 50,000 dams

Apple previews a trio of apps that will finally replace iTunes for Windows

Vulnerability with 9.8 severity in Control Web Panel is under active exploit

This cool new approach to refrigeration could replace harmful chemicals

Leave a reply Cancel reply

Compare items

Shopping cart