-
-
-
-
-
-
Inference Providers
Active filters:
prm, trl
qgallouedec/Qwen2-0.5B-Reward
Token Classification
•
0.5B
•
Updated
•
11
plaguss/Qwen2.5-Math-7B-PRM-0.1
Token Classification
•
7B
•
Updated
•
9
plaguss/Qwen2.5-Math-7B-Instruct-PRM-0.1
Token Classification
•
7B
•
Updated
•
7
plaguss/Qwen2.5-Math-1.5B-Instruct-PRM-0.1
Token Classification
•
2B
•
Updated
•
7
HuggingFaceH4/Qwen2.5-Math-1.5B-Instruct-PRM-0.2
Token Classification
•
2B
•
Updated
•
36
HuggingFaceH4/Qwen2.5-Math-7B-Instruct-PRM-0.2
Token Classification
•
7B
•
Updated
•
35
Token Classification
•
66.4M
•
Updated
•
9
MikeMpapa/TraseSystem-orm-codeblob-verifier
Token Classification
•
0.5B
•
Updated
•
4
smohammadi/Qwen2.5-3B-MathShepherd
Token Classification
•
3B
•
Updated
•
3
axolotl-ai-co/Qwen2.5-Math-PRM-7B
Token Classification
•
7B
•
Updated
•
7
•
1
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-V3
Token Classification
•
0.5B
•
Updated
•
11
alothomas/Qwen2.5-3B-PRM-RAD-balanced-V3
Token Classification
•
3B
•
Updated
•
5
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-V4
Token Classification
•
0.5B
•
Updated
•
16
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-150k
Token Classification
•
0.5B
•
Updated
•
38
alothomas/Qwen2.5-3B-PRM-RAD-balanced-150k
Token Classification
•
3B
•
Updated
•
8
hzy/Qwen2.5-Math-7B-Instruct-PRM-Modified-math_shepherd
Token Classification
•
7B
•
Updated
•
9
jacopo-minniti/uats-value-model
Token Classification
•
2B
•
Updated
•
3
jacopo-minniti/Qwen2.5-Math-7B-PUM
Token Classification
•
7B
•
Updated
•
4
jacopo-minniti/Qwen2.5-Math-7B-PUM-half_entropy
Token Classification
•
7B
•
Updated
•
4
jacopo-minniti/Qwen2.5-Math-7B-PUM-soft-classification
2B
•
Updated
•
7
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-150k-LastStepOnly
Token Classification
•
0.5B
•
Updated
•
4
jacopo-minniti/Qwen2.5-Math-1.5B-PUM-variance
2B
•
Updated
•
10
jacopo-minniti/Qwen2.5-Math-1.5B-PUM-binary-variance
Token Classification
•
2B
•
Updated
•
7
yungshun317/qwen2.5-0.5B-prm-mathshepherd
Token Classification
•
0.5B
•
Updated
•
4
jacopo-minniti/R1-Qwen-MMLU-1.5B-PUM-Variance
2B
•
Updated
•
154
jacopo-minniti/R1-Qwen-MMLU-1.5B-PRM
2B
•
Updated
•
61
jacopo-minniti/R1-Qwen-MMLU-1.5B-PRM-Regression
2B
•
Updated
•
94
ZaandaTeika/Qwen2.5-Math-7B-Instruct-SHARP-Math-PRM
Token Classification
•
7B
•
Updated
•
5
ZaandaTeika/Qwen2.5-Math-1.5B-Instruct-SHARP-Math-PRM
Token Classification
•
2B
•
Updated
•
10