Running Featured 1.31k FineWeb: decanting the web for the finest text data at scale 🍷 1.31k Generate a curated web‑text dataset for LLM training
Running on CPU Upgrade Featured 3.04k The Smol Training Playbook 📚 3.04k The secrets to building world-class LLMs
💧 LFM2.5 Collection Collection of Instruct, Base, and Japanese LFM2.5-1.2B models. • 22 items • Updated 15 days ago • 101
Trinity Collection Collection of Arcee AI models in the Trinity family • 10 items • Updated 5 days ago • 26
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 15 items • Updated about 19 hours ago • 533
Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated Jan 27 • 172