4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding
Paper • 2605.05997 • Published • 18
Feeling and building the multimodal intelligence.
ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling