view post Post 896 TRL is officially an adult 🥳excited to announce TRL v1.0❗️head to the blog to see how we got here and what’s next for this post-training library, designed to keep pace with the fieldhttps://huggingface.co/blog/trl-v1 See translation
Bringing Autonomous Driving RL to OpenEnv and TRL resources Blog: https://huggingface.co/blog/sergiopaniego/bringing-carla-to-openenv-trl/ Runtime error RL CARLA Environment Server 🚗 Control a CARLA driving simulation with custom actions Runtime error RL CARLA Environment Server 🚗 Control a Carla driving simulation with custom actions Sleeping Carla Grpo Trolley 🚀 Visualize your program’s I/O activity in real time sergiopaniego/Qwen3-0.6B-carla-trolley-escape 0.8B • Updated Feb 26 • 127
📝 Research & Long-Form Blog Posts In-depth technical articles and research pieces published by Hugging Face Running 3.76k The Ultra-Scale Playbook 🌌 3.76k The ultimate guide to training LLM on large GPU Clusters Running on CPU Upgrade Featured 3.07k The Smol Training Playbook 📚 3.07k The secrets to building world-class LLMs Running 297 Evaluation Guidebook 📝 297 Explore LLM benchmark trends over time Running 221 FineVision: Open Data is All You Need 📝 221 A new open-source dataset for training VLMs
Running 3.76k The Ultra-Scale Playbook 🌌 3.76k The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade Featured 3.07k The Smol Training Playbook 📚 3.07k The secrets to building world-class LLMs
Bringing Autonomous Driving RL to OpenEnv and TRL resources Blog: https://huggingface.co/blog/sergiopaniego/bringing-carla-to-openenv-trl/ Runtime error RL CARLA Environment Server 🚗 Control a CARLA driving simulation with custom actions Runtime error RL CARLA Environment Server 🚗 Control a Carla driving simulation with custom actions Sleeping Carla Grpo Trolley 🚀 Visualize your program’s I/O activity in real time sergiopaniego/Qwen3-0.6B-carla-trolley-escape 0.8B • Updated Feb 26 • 127
📝 Research & Long-Form Blog Posts In-depth technical articles and research pieces published by Hugging Face Running 3.76k The Ultra-Scale Playbook 🌌 3.76k The ultimate guide to training LLM on large GPU Clusters Running on CPU Upgrade Featured 3.07k The Smol Training Playbook 📚 3.07k The secrets to building world-class LLMs Running 297 Evaluation Guidebook 📝 297 Explore LLM benchmark trends over time Running 221 FineVision: Open Data is All You Need 📝 221 A new open-source dataset for training VLMs
Running 3.76k The Ultra-Scale Playbook 🌌 3.76k The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade Featured 3.07k The Smol Training Playbook 📚 3.07k The secrets to building world-class LLMs
pinned Running on Zero Featured 114 VLM Object Understanding 🦀 Explore object detection, visual grounding, keypoint Detecti
Sleeping Browsergym-grpo-Qwen-Qwen3-0.6B-2026-03-24 19-04-04 🚀 Show your experiment tracking data instantly
sergiopaniego/browsergym-grpo-functiongemma-270m-it-dataset Viewer • Updated 3 days ago • 105 • 10.5k • 1