view article Article Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face +2 Oct 16 • 18
view article Article Back to The Future: Evaluating AI Agents on Predicting Future Events +5 Jul 17 • 47
view article Article The Anthropic Ruling: Why AI Training Just Got Legal (But Piracy Didn't) Jun 24 • 10
view article Article Saying Thank You to a LLM Isn't Free — Measuring the Energy Cost of Politeness Jun 12 • 23
view article Article Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models. May 15 • 36
view article Article Introducing HELMET: Holistically Evaluating Long-context Language Models +5 Apr 16 • 40
An Investigation of FP8 Across Accelerators for LLM Inference Paper • 2502.01070 • Published Feb 3 • 3