-
-
-
-
-
-
Inference Providers
Active filters: prm, trl
qgallouedec/Qwen2-0.5B-Reward
Token Classification
• 0.5B • Updated
• 1
plaguss/Qwen2.5-Math-7B-PRM-0.1
Token Classification
• 7B • Updated
plaguss/Qwen2.5-Math-7B-Instruct-PRM-0.1
Token Classification
• 7B • Updated
• 1
plaguss/Qwen2.5-Math-1.5B-Instruct-PRM-0.1
Token Classification
• 2B • Updated
HuggingFaceH4/Qwen2.5-Math-1.5B-Instruct-PRM-0.2
Token Classification
• 2B • Updated
• 60
HuggingFaceH4/Qwen2.5-Math-7B-Instruct-PRM-0.2
Token Classification
• 7B • Updated
• 11
Token Classification
• 66.4M • Updated
• 1
MikeMpapa/TraseSystem-orm-codeblob-verifier
Token Classification
• 0.5B • Updated
• 1
smohammadi/Qwen2.5-3B-MathShepherd
Token Classification
• 3B • Updated
• 3
axolotl-ai-co/Qwen2.5-Math-PRM-7B
Token Classification
• 7B • Updated
• 1
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-V3
Token Classification
• 0.5B • Updated
• 1
alothomas/Qwen2.5-3B-PRM-RAD-balanced-V3
Token Classification
• 3B • Updated
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-V4
Token Classification
• 0.5B • Updated
• 1
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-150k
Token Classification
• 0.5B • Updated
alothomas/Qwen2.5-3B-PRM-RAD-balanced-150k
Token Classification
• 3B • Updated
hzy/Qwen2.5-Math-7B-Instruct-PRM-Modified-math_shepherd
Token Classification
• 7B • Updated
jacopo-minniti/uats-value-model
Token Classification
• 2B • Updated
• 1
jacopo-minniti/Qwen2.5-Math-7B-PUM
Token Classification
• 7B • Updated
jacopo-minniti/Qwen2.5-Math-7B-PUM-half_entropy
Token Classification
• 7B • Updated
jacopo-minniti/Qwen2.5-Math-7B-PUM-soft-classification
2B • Updated
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-150k-LastStepOnly
Token Classification
• 0.5B • Updated
• 1
jacopo-minniti/Qwen2.5-Math-1.5B-PUM-variance
2B • Updated
jacopo-minniti/Qwen2.5-Math-1.5B-PUM-binary-variance
Token Classification
• 2B • Updated
jacopo-minniti/Qwen2.5-Math-1.5B-PUM-entropy_binary
Token Classification
• 2B • Updated
• 2
yungshun317/qwen2.5-0.5B-prm-mathshepherd
Token Classification
• 0.5B • Updated
• 1
jacopo-minniti/R1-Qwen-MMLU-1.5B-PUM-Variance
2B • Updated
• 1
jacopo-minniti/R1-Qwen-MMLU-1.5B-PRM
2B • Updated
• 10
jacopo-minniti/R1-Qwen-MMLU-1.5B-PRM-Regression
2B • Updated
• 1
ZaandaTeika/Qwen2.5-Math-7B-Instruct-SHARP-Math-PRM
Token Classification
• 7B • Updated
• 1