VoiceCraft
Visit ToolVoiceCraft is an Audio & Music tool that enables zero-shot speech editing and text-to-speech synthesis. It allows users to edit and generate speech from text using a few seconds of reference audio.
At a glance
Trending
VoiceCraft is an Audio & Music tool that enables zero-shot speech editing and text-to-speech synthesis. It allows users to edit and generate speech from text using a few seconds of reference audio.
Trending
About
VoiceCraft is an advanced open-source tool designed for zero-shot speech editing and text-to-speech (TTS) generation. It leverages a token infilling neural codec language model to achieve state-of-the-art performance on diverse, real-world audio data, including audiobooks, internet videos, and podcasts. Users can clone or edit an unseen voice with just a few seconds of reference audio. The tool offers flexible inference options, including Google Colab, Docker, and standalone command-line scripts, making it accessible for various technical skill levels. It also supports model development, training, and finetuning, providing comprehensive capabilities for speech manipulation and synthesis.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending