HD's picture

1

HD PRO

hdong0

AI & ML interests

None yet

Organizations

None yet

models 62

hdong0/deepseek-Qwen-1.5B-batch-mix-GRPO_deepscaler_acc_seq_end_mask_thin_mu_8_warmed_4x4_plus

Text Generation • 2B • Updated Oct 28, 2025 • 1

hdong0/deepseek-Qwen-7B-batch-mix-GRPO_deepscaler_acc_seq_end_mask_thin_mu_8_warmed_4x4

Text Generation • 8B • Updated Oct 25, 2025 • 6

hdong0/deepseek-Qwen-1.5B-batch-mix-GRPO_deepscaler_acc_seq_end_mask_thin_mu_8_warmed_4x4

Text Generation • 2B • Updated Oct 24, 2025 • 22

hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_8192_to_16384_nokl

Text Generation • 8B • Updated Oct 15, 2025 • 9

hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_4096_to_16384_nokl

Text Generation • 8B • Updated Oct 14, 2025

hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_2048_to_16384_nokl

Text Generation • 8B • Updated Oct 12, 2025 • 6

hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_16384_nokl

Text Generation • 8B • Updated Oct 10, 2025 • 2

hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_4096_nokl

Text Generation • 8B • Updated Oct 9, 2025 • 3

hdong0/Qwen3-8B-base-Open-R1-GRPO_dapo_acc_2048_nokl

Text Generation • 8B • Updated Oct 8, 2025 • 2

hdong0/Qwen3-1.7B-base-Open-R1-GRPO_dapo_acc_4096_nokl

Text Generation • 2B • Updated Oct 7, 2025

datasets 1

hdong0/Qwen__Qwen2.5-Math-1.5B_num_erased_tokens_128_remove_think_prompt_1

Viewer • Updated May 29, 2025 • 103k • 11