-
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing
Paper • 2306.10012 • Published • 37 -
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Paper • 2403.05135 • Published • 45 -
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
Paper • 2408.06072 • Published • 38 -
haoningwu/StoryGen
Updated • 4
Mwangi PRO
Benson
AI & ML interests
None yet
Recent Activity
upvoted a paper 10 minutes ago
WAVE: Learning Unified & Versatile Audio-Visual Embeddings with
Multimodal LLM upvoted a paper about 10 hours ago
A Simple Baseline for Streaming Video Understanding liked a model about 23 hours ago
tsinghua-ee/WAVE-7BOrganizations
None yet