33
/de/
AIzaSyAYiBZKx7MnpbEhh9jyipgxe19OcubqV5w
August 1, 2025
8545169
809611
2

16 Jun 2023 Jahr - Meta introduces Voicebox speech model

Beschreibung:

Voicebox is the first model that can generalize to speech-generation tasks it was not specifically trained to accomplish with state-of-the-art performance. It can synthesize speech across six languages, as well as perform noise removal, content editing, style conversion, and diverse sample generation. Voicebox outperforms the current state-of-the-art English model VALL-E on zero-shot text-to-speech in terms of both intelligibility and audio similarity, while being as much as 20 times faster. It also achieves new state-of-the-art results on cross-lingual style transfer.

Zugefügt zum Band der Zeit:

Datum:

16 Jun 2023 Jahr
Jetzt
~ 2 years ago

Abbildungen: