·
AI & ML interests
None yet
Organizations
yyqoni/Phi-3-mini-4k-bandit-ppo-60k
Text Generation
•
4B
•
Updated
•
6
yyqoni/rlhflow-llama-3-sft-8b-v2-token-ppo-60k
Text Generation
•
8B
•
Updated
•
7
yyqoni/rlhflow-llama-3-sft-8b-v2-bandit-ppo-60k
Text Generation
•
8B
•
Updated
•
8
yyqoni/meta-llama-3.1-instruct-8b-bandit-ppo-60k
Text Generation
•
8B
•
Updated
•
5
yyqoni/meta-llama-3.1-instruct-8b-token-ppo-60k
Text Generation
•
8B
•
Updated
•
14
yyqoni/Phi-3-mini-4k-token-ppo-60k
Text Generation
•
4B
•
Updated
•
9
yyqoni/meta-llama-3.1-instruct-8b-segment-ppo-60k
Text Generation
•
8B
•
Updated
•
10
•
1
yyqoni/rlhflow-llama-3-sft-8b-v2-segment-ppo-60k
Text Generation
•
8B
•
Updated
•
8
yyqoni/Phi-3-mini-4k-segment-ppo-60k
Text Generation
•
4B
•
Updated
•
7
yyqoni/meta-llama-3.1-instruct-8b-bandit-rm-700k
Text Classification
•
8B
•
Updated
•
10
yyqoni/rlhflow-llama-3-sft-8b-v2-bandit-rm-700k
Text Classification
•
8B
•
Updated
•
11
yyqoni/Phi-3-mini-4k-instruct-bandit-rm-700k
Text Classification
•
4B
•
Updated
•
13
yyqoni/meta-llama-3.1-instruct-8b-token-rm-700k
Text Classification
•
8B
•
Updated
•
6
yyqoni/rlhflow-llama-3-sft-8b-v2-token-rm-700k
Text Classification
•
8B
•
Updated
•
11
yyqoni/Phi-3-mini-4k-instruct-token-rm-700k
Text Classification
•
4B
•
Updated
•
11
yyqoni/meta-llama-3.1-instruct-8b-segment-rm-700k
Text Classification
•
8B
•
Updated
•
10
yyqoni/rlhflow-llama-3-sft-8b-v2-segment-rm-700k
Text Classification
•
8B
•
Updated
•
7
yyqoni/Phi-3-mini-4k-instruct-segment-rm-700k
Text Classification
•
4B
•
Updated
•
16