jun 16, 2023 - Meta introduces Voicebox speech model

Description:

Voicebox is the first model that can generalize to speech-generation tasks it was not specifically trained to accomplish with state-of-the-art performance. It can synthesize speech across six languages, as well as perform noise removal, content editing, style conversion, and diverse sample generation. Voicebox outperforms the current state-of-the-art English model VALL-E on zero-shot text-to-speech in terms of both intelligibility and audio similarity, while being as much as 20 times faster. It also achieves new state-of-the-art results on cross-lingual style transfer.

Added to timeline:

AI Timeline: News and Events from 1950 - 2026 that have defined Artificial Intelligence

ByAndrew Spoeth

16 days ago

35732

Date:

jun 16, 2023

Now

~ 3 years ago

Images:

About & Feedback Terms Privacy FAQ

Support 24/7

Dashboard Get premium

Donate

The service accepts bank transfer (ACH, Wire) or cards (Visa, MasterCard, etc). Processed by Stripe.

Secured with SSL