T-REGS: Minimum Spanning Tree Regularization for Self-Supervised Learning Paper • 2510.23484 • Published Oct 27 • 3
MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency Paper • 2510.25897 • Published Oct 29 • 16
Pulp Motion: Framing-aware multimodal camera and human motion generation Paper • 2510.05097 • Published Oct 6 • 3
Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models Paper • 2508.09968 • Published Aug 13 • 15
Boosting Generative Image Modeling via Joint Image-Feature Synthesis Paper • 2504.16064 • Published Apr 22 • 14
Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs Paper • 2504.00072 • Published Mar 31 • 6
How far can we go with ImageNet for Text-to-Image generation? Paper • 2502.21318 • Published Feb 28 • 26
AnySat: An Earth Observation Model for Any Resolutions, Scales, and Modalities Paper • 2412.14123 • Published Dec 18, 2024 • 11
MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views Paper • 2412.06767 • Published Dec 9, 2024 • 8
Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation Paper • 2412.06781 • Published Dec 9, 2024 • 24