Discover VoiceCraft, the latest advancement in AI-powered text-to-speech and audio editing.
VoiceCraft is a token infilling neural codec language model that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS). It excels with ‚in-the-wild‘ data, including audiobooks, internet videos, and podcasts.
Notably, VoiceCraft can clone an unseen voice or edit a recording using only a few seconds of the target voice.
Learn more about VoiceCraft: