From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 2 days ago • 58
DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off Paper • 2604.13902 • Published 20 days ago • 62
GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents Paper • 2604.07429 • Published 27 days ago • 119
WildDet3D: Scaling Promptable 3D Detection in the Wild Paper • 2604.08626 • Published 26 days ago • 245
ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement Paper • 2604.01591 • Published Apr 2 • 42
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 501