Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 501
Running Featured 1.25k FineWeb: decanting the web for the finest text data at scale 🍷 1.25k Generate high-quality text data for LLMs using FineWeb
Running 3.63k The Ultra-Scale Playbook 🌌 3.63k The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade Featured 2.81k The Smol Training Playbook 📚 2.81k The secrets to building world-class LLMs
Some of the Papers I've Read Collection A few of the research papers that I've read. • 9 items • Updated Sep 21, 2025
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey Paper • 2509.02547 • Published Sep 2, 2025 • 228
Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification Paper • 2502.01839 • Published Feb 3, 2025 • 10
Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge Paper • 2407.19594 • Published Jul 28, 2024 • 21
view article Article Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework May 7, 2024 • 3