MoGA: Mixture-of-Groups Attention for End-to-End Long Video Generation Paper • 2510.18692 • Published Oct 21, 2025 • 40
UltraGen: High-Resolution Video Generation with Hierarchical Attention Paper • 2510.18775 • Published Oct 21, 2025 • 17
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning Paper • 2506.09985 • Published Jun 11, 2025 • 29
ATLAS: Learning to Optimally Memorize the Context at Test Time Paper • 2505.23735 • Published May 29, 2025 • 22
Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks Paper • 2505.11881 • Published May 17, 2025 • 4
Efficient Generative Model Training via Embedded Representation Warmup Paper • 2504.10188 • Published Apr 14, 2025 • 12