ChatGPT Creator OpenAI’s Voice Cloning Technology Is So Good That Even They Find It Too Scary for Public Release

OpenAI, a prominent artificial intelligence research organization, published a blog post on March 29, discussing their latest development: Voice Engine. This model, created in late 2022, can generate natural-sounding speech that closely resembles the original speaker using only a 15-second audio sample and text input. While the technology is impressive, OpenAI is cautious about its broader release due to the potential for misuse.

Voice Engine has already been used in various applications, such as powering preset voices in OpenAI’s text-to-speech API and enhancing ChatGPT Voice and Read Aloud features. To better understand the real-world applications of Voice Engine, OpenAI has been working with a select group of trusted partners since late 2022.

These collaborations have yielded interesting results, with companies like Age of Learning using Voice Engine for personalized educational content, HeyGen leveraging it for video translation, and Dimagi utilizing it to provide interactive feedback to community health workers. The technology has even been piloted in healthcare, with the Norman Prince Neurosciences Institute at Lifespan using it to restore the voices of patients with speech impairments.

However, OpenAI is well aware of the risks associated with generating speech that closely mimics people’s voices, particularly in an election year. To address these concerns, the company has implemented safety measures and usage policies for their partners, such as prohibiting impersonation without consent, requiring explicit permission from the original speaker, and using watermarking to trace the origin of generated audio.

As synthetic speech technology advances, OpenAI is advocating for proactive measures to ensure its responsible deployment. This includes phasing out voice-based authentication for sensitive information, educating the public on the capabilities and limitations of AI, and developing techniques to track the origin of audiovisual content.

In line with their commitment to AI safety, OpenAI has decided to preview Voice Engine but not release it widely at this time. By sharing these insights, the company aims to initiate a conversation about the future of synthetic voices and the necessary steps to harness their potential while mitigating the risks of misuse.

Here are a few reactions to OpenAI’s announcement:

Voice AI is by far the most dangerous modality.

Superhuman, persuasive voice is something we have minimal defences to.

Figuring out what to do about this should be one of our top priorities.

(We had sota models but didn’t release for this reason eg https://t.co/vjY99uCdTl) https://t.co/fKIZrVQCml

— Emad acc/acc (@EMostaque) March 29, 2024

If you haven’t disabled voice authentication for your bank account and had a conversation with your family about AI voice impersonation yet, now would be a good time. https://t.co/TkpdGUfr76

— Noam Brown (@polynoamial) March 29, 2024

OpenAI has had wild speech tech for a while now.

We’re still unsure whether/how we want to make them widely available ourselves (which ofc raises a bunch of issues), but it’s just a matter of time before someone does, and more should be done to prepare: https://t.co/8F2jTqbrLO

— Miles Brundage (@Miles_Brundage) March 29, 2024

Featured Image via Pixabay

Read the full article here

Trending

FARTCOIN Price Plummets After AI Agent Cashes Out $25M

Ledger Co-Founder Kidnapped and Released After Intense Rescue Mission

Bitcoin Is the Solution to Economic and Political Challenges: CEO BlackRock

Big XRP Contributor Issues Crucial Warning as Major RLUSD Innovation Nears

XRP Price Prediction for January 23

ChatGPT Creator OpenAI’s Voice Cloning Technology Is So Good That Even They Find It Too Scary for Public Release

FARTCOIN Price Plummets After AI Agent Cashes Out $25M

Ledger Co-Founder Kidnapped and Released After Intense Rescue Mission

Bitcoin Is the Solution to Economic and Political Challenges: CEO BlackRock

Big XRP Contributor Issues Crucial Warning as Major RLUSD Innovation Nears

Ledger Co-Founder Kidnapped and Released After Intense Rescue Mission

Bitcoin Is the Solution to Economic and Political Challenges: CEO BlackRock

Big XRP Contributor Issues Crucial Warning as Major RLUSD Innovation Nears

Trending

ChatGPT Creator OpenAI’s Voice Cloning Technology Is So Good That Even They Find It Too Scary for Public Release

Related News