SEMINAR

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Sewon Kim
2025.09.12
Natural Language Processing
Diffusion
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
VENUE2025 ICLR
PAPER LINKOpenReview