view article Article Ulysses Sequence Parallelism: Training with Million-Token Contexts 4 days ago • 14
V2M-Zero: Zero-Pair Time-Aligned Video-to-Music Generation Paper • 2603.11042 • Published 1 day ago • 2
Just-in-Time: Training-Free Spatial Acceleration for Diffusion Transformers Paper • 2603.10744 • Published 1 day ago • 5
Deep Learning-Based Multiclass Classification of Oral Lesions with Stratified Augmentation Paper • 2511.21582 • Published Nov 26, 2025 • 1
Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence Paper • 2603.07660 • Published 5 days ago • 75
PureCC: Pure Learning for Text-to-Image Concept Customization Paper • 2603.07561 • Published 5 days ago • 8
FVG-PT: Adaptive Foreground View-Guided Prompt Tuning for Vision-Language Models Paper • 2603.08708 • Published 3 days ago • 5
Generalizable Knowledge Distillation from Vision Foundation Models for Semantic Segmentation Paper • 2603.02554 • Published 10 days ago • 2
HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising Paper • 2603.08703 • Published 3 days ago • 28
Believe Your Model: Distribution-Guided Confidence Calibration Paper • 2603.03872 • Published 9 days ago • 36
Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling Paper • 2603.04791 • Published 8 days ago • 16
HDINO: A Concise and Efficient Open-Vocabulary Detector Paper • 2603.02924 • Published 10 days ago • 1
MUSE: A Run-Centric Platform for Multimodal Unified Safety Evaluation of Large Language Models Paper • 2603.02482 • Published 10 days ago • 3
RIVER: A Real-Time Interaction Benchmark for Video LLMs Paper • 2603.03985 • Published 9 days ago • 5
T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning Paper • 2603.03790 • Published 9 days ago • 113
Heterogeneous Agent Collaborative Reinforcement Learning Paper • 2603.02604 • Published 10 days ago • 170