VideoChat
Visit ToolVideoChat is an open-source AI tool that enables real-time voice interactive digital humans. It allows for customizable appearance and voice, supports voice cloning, and offers low dialogue latency.
At a glance
Trending
VideoChat is an open-source AI tool that enables real-time voice interactive digital humans. It allows for customizable appearance and voice, supports voice cloning, and offers low dialogue latency.
Trending
About
VideoChat is an open-source project designed for creating real-time voice interactive digital humans. Users can customize the appearance and voice of these digital avatars, with support for voice cloning. The platform boasts low dialogue latency, with initial package delays as low as 3 seconds. It supports both end-to-end (MLLM - THG) and cascaded (ASR-LLM-TTS-THG) solutions, offering flexibility based on hardware capabilities. Key technologies integrated include FunASR for automatic speech recognition, Qwen and GLM-4-Voice for large language models, GPT-SoVITS, CosyVoice, and edge-tts for text-to-speech, and MuseTalk for talking head generation. The project provides options for local deployment, including managing GPU memory requirements and configuring API keys for LLM and TTS modules.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending