pittawat/qwen2.5-0.5b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v6 0.6B • Updated Nov 22, 2025 • 8
pittawat/qwen2.5-1.5b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v6 2B • Updated Nov 22, 2025 • 9
pittawat/qwen2.5-7b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v6-reversed 8B • Updated Nov 13, 2025 • 8
pittawat/qwen2.5-14b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v6 15B • Updated Nov 11, 2025 • 15
pittawat/llama3.1-8b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v6 8B • Updated Nov 5, 2025 • 20
pittawat/qwen2.5-7b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v6-2x-train-set 8B • Updated Nov 5, 2025 • 20
pittawat/qwen2.5-7b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v6-2x-epochs 8B • Updated Nov 5, 2025 • 16
pittawat/qwen2.5-3b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v6 3B • Updated Nov 5, 2025 • 14
pittawat/qwen2.5-7b-instruct-math-1k-grpo-with-length-0.1-cot-prompt-v6-new-with-coeff 8B • Updated Oct 30, 2025 • 13
pittawat/qwen2.5-7b-instruct-new-math-1k-dapo-with-length-0.1-cot-prompt-v6 8B • Updated Oct 29, 2025 • 19
pittawat/qwen2.5-7b-instruct-new-math-1k-grpo-with-length-0.5-cot-prompt-v6 8B • Updated Oct 29, 2025 • 20
pittawat/qwen2.5-7b-instruct-new-math-1k-grpo-with-length-0.3-cot-prompt-v6 8B • Updated Oct 29, 2025 • 16
pittawat/qwen2.5-7b-instruct-math-1k-grpo-with-length-0.1-cot-prompt-v6-c-2 8B • Updated Oct 28, 2025 • 16
pittawat/qwen2.5-7b-instruct-math-1k-grpo-with-length-0.1-cot-prompt-v6-c-1 8B • Updated Oct 28, 2025 • 15
pittawat/qwen2.5-7b-instruct-new-math-1k-medium-grpo-with-length-0.1-cot-prompt-v6 8B • Updated Oct 27, 2025 • 17
pittawat/qwen2.5-7b-instruct-new-math-1k-grpo-with-length-0.1-cot-prompt-v2 8B • Updated Oct 27, 2025 • 33
pittawat/qwen2.5-7b-instruct-new-math-1k-hard-grpo-with-length-0.1-cot-prompt-v6 8B • Updated Oct 27, 2025 • 18