On Data Engineering for Scaling LLM Terminal Capabilities Paper • 2602.21193 • Published Feb 24 • 101
AlienKevin/nemotron-terminal-8b-25pct-eval-terminal-bench-lite-concurrency-25 Updated 10 days ago • 192
AlienKevin/nemotron-terminal-8b-25pct-eval-terminal-bench-lite-concurrency-25 Updated 10 days ago • 192
AlienKevin/nemotron-terminal-8b-5pct-rand-skill-based-eval-terminal-bench-lite-concurrency-25 Viewer • Updated 13 days ago • 1.4k • 2.06k
AlienKevin/nemotron-terminal-8b-5pct-rand-skill-based-eval-terminal-bench-lite-concurrency-25 Viewer • Updated 13 days ago • 1.4k • 2.06k
AlienKevin/sweb-verified-rand-100-mini-swe-v2.2.7-regex-parser-qwen3-32b-eval Updated 13 days ago • 197
AlienKevin/sweb-verified-rand-100-mini-swe-v2.2.7-regex-parser-qwen3-32b-eval Updated 13 days ago • 197
AlienKevin/sweb-verified-rand-100-mini-swe-v2.2.7-regex-parser-qwen3-8b-required-workflow-eval Updated 13 days ago • 109
AlienKevin/sweb-verified-rand-100-mini-swe-v2.2.7-regex-parser-qwen3-8b-required-workflow-eval Updated 13 days ago • 109
AlienKevin/sweb-verified-rand-100-mini-swe-v2.2.7-regex-parser-qwen3-8b-eval Updated 13 days ago • 96
AlienKevin/sweb-verified-rand-100-mini-swe-v2.2.7-regex-parser-qwen3-8b-view-before-edit-system-prompt-eval Updated 13 days ago • 106
AlienKevin/sweb-verified-rand-100-mini-swe-v2.2.7-regex-parser-qwen3-8b-view-before-edit-system-prompt-eval Updated 13 days ago • 106
AlienKevin/sweb-verified-rand-100-mini-swe-v2.2.7-regex-parser-qwen3-8b-eval Updated 13 days ago • 96
AlienKevin/nemotron-terminal-8b-eval-terminal-bench-lite-concurrency-100 Viewer • Updated 15 days ago • 6.25k • 1.67k