LoRA trained on GSM8k Dataset for Qwen/Qwen2.5-Math-1.5B
eval results on gsm8k test set
correct format: 1122/1319
correct reward: 397/1319
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support