DCAgent/eval-terminal-bench-2.0-gemini-2.5-flash-20260114_222605 Viewer • Updated about 8 hours ago • 312
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-gpt-5-nano-2025-08-07-20260114_142654 Viewer • Updated about 9 hours ago • 293
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-gpt-5-mini-2025-08-07-20260114_222454 Viewer • Updated about 13 hours ago • 300
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-gemini-2.5-flash-20260114_200318 Viewer • Updated about 16 hours ago • 339 • 3
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gpt-5-mini-2025-08-07-20260114_203811 Viewer • Updated about 16 hours ago • 216 • 3
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-claude-haiku-4-5-20251001-20260114_164534 Viewer • Updated about 17 hours ago • 195 • 6
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gemini-2.5-flash-20260114_175612 Viewer • Updated about 19 hours ago • 266 • 7
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epocf7b91126 Viewer • Updated about 20 hours ago • 305 • 6
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gpt-5-nano-2025-08-07-20260114_152435 Viewer • Updated about 20 hours ago • 198 • 8