Unified multimodal model for text-to-image generation, instruction-guided image editing, personalized generation, and virtual try-on
Unified multimodal model for text-to-image generation, instruction-guided image editing, personalized generation, and virtual try-on