GenRM/gutenberg-dpo-v0.1-jondurbin
Viewer
•
Updated
•
918
•
66
GenRM/HelpSteer2-DPO-Atsunori
Viewer
•
Updated
•
7.59k
•
16
GenRM/MetaMath_DPO_FewShot-abacusai
Viewer
•
Updated
•
395k
•
35
GenRM/reddit-dpo-nbeerbower
Viewer
•
Updated
•
76.9k
•
13
GenRM/function-calling-v0.2-with-r1-cot-AymanTarig
Viewer
•
Updated
•
58k
•
12
GenRM/Magpie-Reasoning-V2-250K-CoT-Deepseek-R1-Llama-70B-Magpie-Align
Viewer
•
Updated
•
250k
•
27
GenRM/dolphin-r1-cognitivecomputations
Updated
•
40
Updated
•
15
GenRM/Bespoke-Stratos-17k-bespokelabs
Viewer
•
Updated
•
16.7k
•
46
GenRM/OpenThoughts-114k-open-thoughts
Viewer
•
Updated
•
114k
•
71
GenRM/R1-Distill-SFT-ServiceNow-AI
Viewer
•
Updated
•
172k
•
52
GenRM/Magpie-Gemma2-Pro-200K-Filtered-Magpie-Align
Viewer
•
Updated
•
200k
•
10
GenRM/filtered_DeepSeek-R1-Distill-Llama-8B-avrecum
Viewer
•
Updated
•
600
•
6
Updated
•
16
GenRM/ultrafeedback_binarized_cleaned-allenai
Preview
•
Updated
•
13
GenRM/orca_dpo_pairs-Intel
Viewer
•
Updated
•
12.9k
•
8
GenRM/distilabel-math-preference-dpo-argilla
Viewer
•
Updated
•
2.42k
•
8
GenRM/Math-Step-DPO-10K-xinlai
Viewer
•
Updated
•
10.8k
•
7
GenRM/Code-Preference-Pairs-Vezora
Viewer
•
Updated
•
54k
•
15
GenRM/Magpie-Air-DPO-100K-v0.1-Magpie-Align
Viewer
•
Updated
•
100k
•
51
GenRM/Magpie-Llama-3.1-Pro-DPO-100K-v0.1-Magpie-Align
Viewer
•
Updated
•
100k
•
12
GenRM/magpie-ultra-v1.0-argilla
Preview
•
Updated
•
98
Viewer
•
Updated
•
8.5k
•
17
GenRM/darkside-dpo-openvoid
Viewer
•
Updated
•
541
•
10