Return to Veritone Voice
Veritone Voice is a hyper-realistic synthetic Voice as a Service (VaaS) solution that allows content creators and owners across industries to securely and ethically create, distribute, and monetize synthetic voices.
Veritone offers a custom synthetic voice cloning solution that allows users to securely create verified custom synthetic voices that can be created in many different languages.
In addition to custom voices, the Veritone Voice self-serve application enables users to create voice projects from a library of over 300+ stock voices and 70 premium voice-over artists across more than 150 languages.
As a complete end-to-end solution for synthetic voice, Veritone Voice gives users a complete suite of voice capabilities including voice creation, management, licensing, rights and clearances, workflows, and monetization.
Here are a few industries and divisions that can benefit from Veritone Voice:
For more industries, please visit Veritone Voice.
Veritone Voice allows content creators the ability to produce truly lifelike AI voice at unmatched speed and scale; create content on demand using text-to-speech or speech-to-speech input; reach new audiences in localized languages, in real-time, with branded voices.
Custom Voice Cloning:
Produce voice-over content without juggling schedules or paying for studio time. Clone voices including celebrities, sports announcers, and public figures—all you need is their consent. Create localized content on demand using text-to-speech or speech-to-speech input.
Enterprise Workflows:
Take advantage of Veritone’s proven AI expertise to optimize your voice automation output and succeed at scale. From enhancing metadata to generating dialogue, we use best-of-breed AI to deliver the best possible results from end to end.
API & Real-time voice
Extend the power of true-to-life, real-time AI voice across all your products and projects. With our world-class AI voice API, you can save valuable time and automate at scale by connecting Veritone Voice directly to any app.
Stock & Premium AI Voice
Start creating your own text-to-speech synthetic voice projects right away. Choose from more than 300 stock voices or 70 premium options for a voice your audience will recognize. Translate into over 150 languages and customize intonation, gender, dialect, and accent.
Veritone Voice supports both text-to-speech and speech-to-speech modalities giving clients the ability to create voices for all of their voice projects. With Veritone’s VaaS solutions, Veritone Voice offers a comprehensive suite of integrated voice features including voice creation, voice management, voice licensing with rights and clearances, voice workflows, and voice monetization.
Veritone Voice is built on Veritone’s proprietary enterprise AI platform, aiWARE. For an additional fee, users can leverage these cognitive engines, such as translation and transcription and combine them with advanced automated workflows to deliver transformed audio, at scale.
Ownership of one’s voice and protecting their IP is critical. We want to make sure that we not only help our clients generate licensing opportunities but also ensure they have the necessary support to navigate rights and clearances. This will ensure their name, image, and likeness are only being used by approved parties that maintain high standards.
Veritone Voice safeguards include regulated processes and checkpoints to ensure proper rights, clearances, and pricing are followed. Added IP protection includes inaudible watermarks and proprietary tools to help ensure content can only be accessed after permission is granted.
The voice creation process includes both written and verbal consent verification. Once created, the talent has the right to approve all synthetic recordings. All created recordings include an inaudible watermark that Veritone can verify.
All voice training data and voice models are stored in a highly secure, proprietary digital asset management platform, ensuring the protection of your data.
Only authorized users will have access to create new clips, and all clip creation is tracked at the user level. The voice model code only works on Veritone systems and cannot be deployed anywhere else.
If at any time, the voice owner would like their voice clone deprecated, Veritone will destroy the voice model.
For Veritone Voice clients, synthetic voice is a powerful tool that can be used at the complete control of the voice owners. Some clients may use synthetic voice for localization or limited to production editing, but Veritone Voice can also be used for complete end-to-end production. The voice owner has full control, who knows their audience best.
As a best practice, we recommend adding disclaimers so the audience is fully aware that they are hearing a synthetic voice.
Veritone’s VaaS solutions, Veritone Voice offers a comprehensive suite of integrated voice features including voice creation, voice management, voice licensing with rights and clearances, voice workflows, and voice monetization.
Text-to-speech (TTS) is the process of producing synthetic speech from a text file.
Speech-to-speech (STS) is the process of producing synthetic speech from an audio file.
Veritone Voice offers a rich marketplace of over 300 stock voices that is immediately available to customers. You may choose voices from a broad and diverse marketplace of genders, over 150+ languages, numerous accents, and stylize each voice so that it suits your needs. Additionally, select over 70 recognizable voice-artist approved AI voices, available to license at additional cost.
Custom voice creation is supported by our managed services team. To start, the voice talent or individual whose voice will be recorded and used to create a custom voice model must explicitly consent (verbal and written) to the creation of their voice model. If the voice talent is deceased, the estate as well as the IP owner if not the estate must provide explicit consent.
Veritone Voice currently has access to market-leading voice engines that’s growing daily. A member from our managed services team will assist with the proper identification of these models based on use cases.
All voice training data and voice models are stored in a highly secure, proprietary digital asset management platform, ensuring the protection of your data.
Veritone Voice is mobile-responsive and built for any browser on desktop and mobile.
At this time, Veritone Voice does not have a mobile app.
Custom Voices
Starts at $9K/ per voice USD
Contact us to get started
Enterprise Workflows
Contact us for details
Stock & Premium Voices
Starts at $500/mo USD
Contact us to get started
API & Real Time Voice
Contact us for details
Our team of experts works closely with you and your team to thoroughly define a master services agreement or platform licensing is determined.
The VaaS solution includes such features as inaudible watermarks, the automated inclusion of a copyright tone; traceability, the ability to track the components used to replicate your voice clips; licensing protocols, regulated process and checkpoints to ensure proper rights, clearances, and pricing are followed.
No. Veritone has built-in licensing protocols to ensure custom voices are only being used by approved parties that maintain high standards.
For custom voice models, Veritone manages the model creation from end-to-end along with the production of audio files that use the model. All requests for synthetic content creation will come into the experienced Veritone Voice managed services team and only be produced with prior audio and written approval from the voice owner.
Your voice is made into a code, and that code only works on Veritone systems. If you decide to stop using it, we destroy the code of your voice and provide receipt of destruction. It will no longer exist on our servers or be available anywhere, it will be deleted.
Working with the Open Voice Network, IAB, and other governing bodies Veritone will adhere to best practices to protect consumers, and IP (voice) owners.
Depending on the application of synthetic content, the listener may or may not know it’s synthetic. For example, a celebrity authorizing the use of their voice model to fix a bit of audio in a movie, or if they use their voice on content in a foreign language rebroadcast with localized translations, the audio file might go without official notice.
It is a best practice to offer a disclaimer for consumers when synthetic voice is used for net new content particularly if a deceased voice is generated.
Consumer disclosures, in audio and/or visual, may be required when the voice model is being licensed and used for a paid endorsement or for government officials making public statements.
Veritone upholds a promise for good and is committed to working to address public concern and protect the intellectual property of the voice talent and advertising community. We will publish industry best practices and governance for synthetic content usage in public or commercial channels. In addition, Veritone is an active member of the IAB, the Open Voice Network, and other governing bodies as part of our efforts to develop global best practices for synthetic content.