PuLID-FLUX
Generate custom images from text and a reference photo
Generate custom images from text and a reference photo
Generate 3D models from images
Inpaint images with text prompts and custom masks
Generate music from text descriptions
Upscale images with AI-powered high-resolution enhancement
Personalised Podcasts For All - Available in 13 Languages
Import a portrait, click to move the head!
Efficient T2V generation
Co-Speech Gesture Video Generation
Generate images from text prompts
Generate or edit music from text and optional audio
8B parameter transformer model distilled from the FLUX.1-dev
Detect and visualize human poses in images and videos
Generate 3D models from images
Make Custom Voices With KokoroTTS
In-browser unified multimodal understanding and generation.
Generate music from lyrics and genre tags
Remove background from images and videos
Audio Gen, Audio Style Transfer and Audio InPainting
Chat with images and text using AI assistant
Generate images from your text prompt
OmniParser, turn your LLM into GUI agent
A Generalist Diffusion Model for Vision Perception
Blazingly Fast and Embarrassingly Simple Song Generation
Large Avatar Model for One-shot Animatable Gaussian Head
Generate realistic dialogue from a script, using Dia!
ultra-fast video model, LTX 0.9.8 13B distilled
Demo for multimodal understanding and generation
Multimodal Instruction-based Editing and Generation
Fast 4 step inference with Qwen Image Edit 2509
Track and label objects in videos using text prompts or clicks
Generate sharp, focused images from blurry photos with interactive refocusing