Detecting and Preventing Hallucinations in Large Vision Language Models Paper • 2308.06394 • Published Aug 11, 2023
Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains Paper • 2507.17746 • Published Jul 23, 2025 • 3
SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks? Paper • 2509.16941 • Published Sep 21, 2025 • 21
Representation Learning in Continuous-Time Dynamic Signed Networks Paper • 2207.03408 • Published Jul 7, 2022