Running on CPU Upgrade 20 BigCodeBench Evaluator 🥇 20 Evaluate code samples using specified parameters
Running Agents 229 BigCodeBench Leaderboard 🥇 229 Explore code-generation model leaderboards and task details