Tech

OpenAI’s Voice Engine: Revolutionary Voice Cloning Technology Raises Both Excitement and Concerns

Published

March 31, 2024

OpenAI's Voice Mode

The technological advancements in voice cloning have been remarkable in recent years, with the time required to clone someone’s voice shrinking significantly from minutes to mere seconds. OpenAI, a Microsoft-backed company, has developed its own voice-cloning technology, aptly named Voice Engine, since late 2022. Voice Engine requires a minimum of 15 seconds of spoken material to reproduce someone’s voice, and users can input text to create “emotive and realistic” speech that closely resembles the original speaker. This technology is capable of producing speech that is almost indistinguishable from the original, raising concerns about its potential misuse.

One of the concerns is the potential for criminals to clone a person’s voice and then call their friends or relatives, tricking them into sending cash via bank transfer. Additionally, there are concerns about how this technology might be used in the upcoming presidential election, as witnessed in a recent incident involving a robocall using a clone of President Joe Biden’s voice. Furthermore, there is a concern about the impact on the livelihoods of voice actors, who fear that they will be asked to sign over the rights to their voice, allowing AI to create a synthetic version, with compensation likely to be much lower than if the actor was asked to perform the job in person.

Despite these concerns, OpenAI presents a more optimistic perspective on the potential applications of this technology. For instance, Voice Engine could be used to provide reading assistance to non-readers and children using natural-sounding, emotive voices, representing a wider range of speakers than what’s possible with preset voices. Additionally, instant translation of videos and podcasts could be made possible, as Spotify is already trial-testing this technology. Voice Engine could also help patients who are gradually losing their voice due to illness to continue communicating using what sounds like their own voice.

Sam Altman CEO of OpenAI.

The technology has the potential to greatly improve communication and accessibility for individuals with disabilities. OpenAI has shared examples of AI-generated audio and reference audio on its website, showcasing the remarkable capabilities of Voice Engine. The company is, however, taking a cautious and informed approach to the broader release of this technology, aware of the potential risks and consequences. OpenAI has launched a dialogue on the responsible deployment of synthetic voices and how society can adapt to these new capabilities.

As the development of Voice Engine progresses, it will be essential for OpenAI and other companies working on similar technologies to prioritize responsible deployment and address the concerns raised about its potential misuse. By doing so, this technology can be used to improve lives and facilitate more effective communication, while also mitigating the risks and potential negative consequences of its use.

In this article:

Tech

Threads Tests 24-Hour Timer for Ephemeral Posts, Enhancing Content Flexibility

Threads is experimenting with a new feature that allows users to set a 24-hour timer on their posts. After this period, the post and...

DrishtyAugust 26, 2024

AU10TIX Exposes Admin Credentials, Potentially Compromising Client Data for Over a Year

News

AU10TIX Exposes Admin Credentials, Potentially Compromising Client Data for Over a Year

AU10TIX, an Israeli company that verifies IDs for clients like TikTok, X, and Uber, accidentally left important admin credentials exposed for over a year....

Richie Dela CruzJune 27, 2024

Live2Diff - AI Transforms Live Video into Real-Time Stylized Content

Tech

Live2Diff – AI Transforms Live Video into Real-Time Stylized Content

A team of international researchers has developed Live2Diff, an AI system that transforms live video streams into stylized content in near real-time. Named for...

Mason HaleJuly 17, 2024

Charles Hoskinson Criticizes Tron’s USDD for Removing Bitcoin Collateral, Raising Concerns About Decentralization

News

Charles Hoskinson Criticizes Tron’s USDD for Removing Bitcoin Collateral, Raising Concerns About Decentralization

Charles Hoskinson, the founder of Cardano, has voiced dissatisfaction with recent changes to Tron’s native stablecoin, USDD. He reacted to a report indicating that...

Mason HaleAugust 26, 2024

Gizmo Writeups

Tech

OpenAI’s Voice Engine: Revolutionary Voice Cloning Technology Raises Both Excitement and Concerns

You May Also Like

Tech

Threads Tests 24-Hour Timer for Ephemeral Posts, Enhancing Content Flexibility

News

AU10TIX Exposes Admin Credentials, Potentially Compromising Client Data for Over a Year

Tech

Live2Diff – AI Transforms Live Video into Real-Time Stylized Content

News

Charles Hoskinson Criticizes Tron’s USDD for Removing Bitcoin Collateral, Raising Concerns About Decentralization