Running Featured 1.26k FineWeb: decanting the web for the finest text data at scale ๐ท 1.26k Generate high-quality text data for LLMs using FineWeb
Running 3.63k The Ultra-Scale Playbook ๐ 3.63k The ultimate guide to training LLM on large GPU Clusters
The Instruction Gap: LLMs get lost in Following Instruction Paper โข 2601.03269 โข Published 21 days ago โข 7
Running on CPU Upgrade Featured 2.82k The Smol Training Playbook ๐ 2.82k The secrets to building world-class LLMs
view reply You don't really have to clone the repo. The FastAPI code is just there for demonstration, and you can code the way you like. The main takeaway is the Dockerfile.
view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 โข 280
Komodo: A Linguistic Expedition into Indonesia's Regional Languages Paper โข 2403.09362 โข Published Mar 14, 2024 โข 11
Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding Paper โข 2506.16035 โข Published Jun 19, 2025 โข 88
view article Article You could have designed state of the art positional encoding Nov 25, 2024 โข 429
view post Post 1708 Micrograd in pure C๐คPort of Karpathy's micrograd in pure C. Yo C does not negotiate with memory ๐Code: https://github.com/Jaykef/micrograd.c 2 replies ยท ๐ฅ 8 8 ๐ 3 3 + Reply