This ElevenLabs AI tool can create a unique voice based on your X profile
ElevenLabs, a New York-based artificial intelligence (AI) company, has released an application programming interface (API) for its Voice Design feature, which recently debuted. The announcement came last week, and in addition, the company also introduced an open-source project called X to Voice, which can generate a unique voice for an X Profile (formerly known as Twitter) based on the user’s messages. The feature also shows a text prompt that is automatically generated based on the analysis of the profile.
In one blog postElevenLabs has detailed the two new AI tools. The first is an API version of the Voice Design tool, which was recently introduced. Voice Design is a new capability developed by the company that can generate unique AI voices based on text prompts. These voices are based on the description shared by the user, including pitch, timbre, delivery rate, intonation, and more.
This feature is now made available through the company’s API. This means that developers can use this capability to build apps and software. Voice Design can be offered by developers to develop voices for their AI characters, or to users so they can generate new voices for themselves.
The company has offered two endpoints. First allows developers to generate three unique spoken examples from a text prompt. The second allows them to save the spoken samples to their library for local use. ElevenLabs did not highlight the price of the API or the cost per request of the AI model. Details about the AI model are also not known.
The second tool is the company’s open source project called X to Voice. It is an extension of the feature available for testing on a web client here. Users can add an X username and the AI will automatically analyze the profile, including the bio and messages. Once analyzed, it generates a text prompt based on the analysis.
The text prompt is then automatically sent to Voice Design to generate a unique voice for the profile. Gadgets 360 tested the feature and found that it takes between 30 seconds and a minute to generate spoken samples for a profile. A total of three voice samples are generated. The AI voice pronounces a sentence that is partly based on the analysis of the profile.
In addition to the three voice samples, the page also shows the text prompt used to generate the AI voice. We also found that the feature animates the profile photos of users who have added a close-up of their face and syncs lip and mouth movements to match the words being spoken.
For the latest tech news and reviews, follow Gadgets 360 X, Facebook, WhatsApp, Wires And Google News. For the latest videos on gadgets and technology, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who is that360 on Instagram And YouTube.
Realme GT 7 Pro with Snapdragon 8 Elite SoC, 6,500 mAh battery launched: price, specifications