Unidiffuser
Visit ToolUniDiffuser is an open-source image generation tool that provides code and models for multi-modal diffusion research. It enables unified generation across image, text, and joint image-text distributions.
At a glance
Trending
UniDiffuser is an open-source image generation tool that provides code and models for multi-modal diffusion research. It enables unified generation across image, text, and joint image-text distributions.
Trending
About
UniDiffuser is an open-source framework offering code and models for multi-modal diffusion research, based on the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion." It unifies learning for marginal, conditional, and joint distributions by predicting noise in perturbed data across different modalities. The tool uses a transformer architecture to handle various input types and can perform image, text, text-to-image, image-to-text, and image-text pair generation. It supports tasks like image variation and text variation, and can be integrated with the Hugging Face Diffusers library for ease of use. UniDiffuser provides two pretrained models, UniDiffuser-v0 and UniDiffuser-v1, trained on large-scale image-text datasets.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending
Also listed in