Calibrate-Then-Act: Cost-Aware Exploration in LLM Agents Paper • 2602.16699 • Published 21 days ago • 15
Calibrate-Then-Act: Cost-Aware Exploration in LLM Agents Paper • 2602.16699 • Published 21 days ago • 15
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step700_2026-01-27_21-36-45_nvidia_balanced 8B • Updated 28 days ago • 5
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step700_2026-01-27_21-36-45_nvidia_balanced 8B • Updated 28 days ago • 5
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step350_2026-01-27_21-36-45_nvidia_balanced 8B • Updated Jan 28
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step350_2026-01-27_21-36-45_nvidia_balanced 8B • Updated Jan 28
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step50_2026-01-27_21-36-45_nvidia_balanced 8B • Updated Jan 28
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step50_2026-01-27_21-36-45_nvidia_balanced 8B • Updated Jan 28
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step100_2026-01-27_21-36-45_nvidia_balanced 8B • Updated Jan 28
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step100_2026-01-27_21-36-45_nvidia_balanced 8B • Updated Jan 28
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step350_2026-01-27_03-19-15_nvidia_balanced 8B • Updated Jan 27
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step350_2026-01-27_03-19-15_nvidia_balanced 8B • Updated Jan 27
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step300_2026-01-27_03-19-15_nvidia_balanced 8B • Updated Jan 27
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step300_2026-01-27_03-19-15_nvidia_balanced 8B • Updated Jan 27
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step100_2026-01-27_03-19-15_nvidia_balanced 8B • Updated Jan 27
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step100_2026-01-27_03-19-15_nvidia_balanced 8B • Updated Jan 27
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step150_2026-01-27_03-19-15_nvidia_balanced 8B • Updated Jan 27 • 2
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step150_2026-01-27_03-19-15_nvidia_balanced 8B • Updated Jan 27 • 2
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step200_2026-01-27_03-19-15_nvidia_balanced 8B • Updated Jan 27
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step200_2026-01-27_03-19-15_nvidia_balanced 8B • Updated Jan 27