view article Article Train Reasoning Models without External Supervision qingyangzhang • May 18, 2025 • 1