AI & ML interests
None yet
Organizations
None yet
models
62
hdong0/deepseek-Qwen-1.5B-batch-mix-GRPO_deepscaler_acc_seq_end_mask_thin_mu_8_warmed_4x4_plus
Text Generation
•
2B
•
Updated
hdong0/deepseek-Qwen-7B-batch-mix-GRPO_deepscaler_acc_seq_end_mask_thin_mu_8_warmed_4x4
Text Generation
•
8B
•
Updated
•
7
hdong0/deepseek-Qwen-1.5B-batch-mix-GRPO_deepscaler_acc_seq_end_mask_thin_mu_8_warmed_4x4
Text Generation
•
2B
•
Updated
•
23
hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_8192_to_16384_nokl
Text Generation
•
8B
•
Updated
•
10
hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_4096_to_16384_nokl
Text Generation
•
8B
•
Updated
•
8
hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_2048_to_16384_nokl
Text Generation
•
8B
•
Updated
•
17
hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_16384_nokl
Text Generation
•
8B
•
Updated
•
22
hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_4096_nokl
Text Generation
•
8B
•
Updated
•
9
hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_2048_nokl
Text Generation
•
8B
•
Updated
•
2
hdong0/Qwen3-1.7B-base-Open-R1-GRPO_dapo_acc_4096_nokl
Text Generation
•
2B
•
Updated
•
2