FadeMem: Biologically-Inspired Forgetting for Efficient Agent Memory Paper • 2601.18642 • Published Jan 26 • 1
EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings Paper • 2603.13594 • Published 4 days ago • 129
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data Paper • 2603.15594 • Published 1 day ago • 130
In-Context Reinforcement Learning for Tool Use in Large Language Models Paper • 2603.08068 • Published 9 days ago • 37
Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning Paper • 2603.04597 • Published 13 days ago • 196
Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents Paper • 2505.22954 • Published May 29, 2025 • 15
Learning to Continually Learn via Meta-learning Agentic Memory Designs Paper • 2602.07755 • Published Feb 8 • 7
Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions Paper • 1901.01753 • Published Jan 7, 2019 • 2
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning Paper • 2509.24372 • Published Sep 29, 2025 • 12
Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights Paper • 2603.12228 • Published 6 days ago • 10
ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement Learning Paper • 2603.05863 • Published 12 days ago • 5
Test-Driven AI Agent Definition (TDAD): Compiling Tool-Using Agents from Behavioral Specifications Paper • 2603.08806 • Published 9 days ago • 7
Lost in Backpropagation: The LM Head is a Gradient Bottleneck Paper • 2603.10145 • Published 8 days ago • 10