-
CL-From-Nothing/student_prefix_kukurasu_20K_continual_Qwen3_4B_Thinking_nemotron-cascade-8b_epoch_3_mask
8B • Updated • 6 -
CL-From-Nothing/student_prefix_kukurasu_20K_continual_Qwen3_4B_Thinking_qwen3-1.7b_epoch_3_mask
2B • Updated • 104 -
CL-From-Nothing/student_prefix_minesweeper_kukurasu_continual_Qwen3_4B_Thinking_nemtron_cascade-8b
8B • Updated • 6 -
CL-From-Nothing/student_prefix_minesweeper_kukurasu_continual_Qwen3_4B_Thinking_qwen3-1.7b
2B • Updated • 73
AI & ML interests
None defined yet.
Recent Activity
View all activity
Ablation datasets for cutoff-based completion experiments.
-
CL-From-Nothing/kukurasu-qwen1.7b-cutoff512-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 6 -
CL-From-Nothing/kukurasu-qwen1.7b-cutoff1024-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 10 -
CL-From-Nothing/kukurasu-qwen1.7b-cutoff2048-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 10 -
CL-From-Nothing/kukurasu-nemotron8b-cutoff512-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 8
-
CL-From-Nothing/student_prefix_kukurasu_20K_continual_Qwen3_4B_Thinking_nemotron-cascade-8b_epoch_3_mask
8B • Updated • 6 -
CL-From-Nothing/student_prefix_kukurasu_20K_continual_Qwen3_4B_Thinking_qwen3-1.7b_epoch_3_mask
2B • Updated • 104 -
CL-From-Nothing/student_prefix_minesweeper_kukurasu_continual_Qwen3_4B_Thinking_nemtron_cascade-8b
8B • Updated • 6 -
CL-From-Nothing/student_prefix_minesweeper_kukurasu_continual_Qwen3_4B_Thinking_qwen3-1.7b
2B • Updated • 73
Ablation datasets for cutoff-based completion experiments.
-
CL-From-Nothing/kukurasu-qwen1.7b-cutoff512-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 6 -
CL-From-Nothing/kukurasu-qwen1.7b-cutoff1024-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 10 -
CL-From-Nothing/kukurasu-qwen1.7b-cutoff2048-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 10 -
CL-From-Nothing/kukurasu-nemotron8b-cutoff512-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 8
models 36
CL-From-Nothing/Qwen3-1.7B-TokenReward-Minesweeper-StitchSFT-epoch3
2B • Updated
CL-From-Nothing/Qwen3-1.7B-TokenReward-Minesweeper-MixedSFT-epoch3
2B • Updated
CL-From-Nothing/Qwen3-1.7B-GRPO-Minesweeper-StitchSFT
2B • Updated
CL-From-Nothing/Qwen3-1.7B-GRPO-Minesweeper-MixedSFT
2B • Updated
CL-From-Nothing/Qwen3-1.7B-OPD-distill-stitch-minesweeper
2B • Updated
CL-From-Nothing/Qwen3-1.7B-OPD-distill-mixed-minesweeper
2B • Updated
CL-From-Nothing/reruned-ckpt-for-baseline
2B • Updated • 13
CL-From-Nothing/Qwen3-4B-SSD-RLVE-Eval20-N20-global-step-500
Text Generation • 4B • Updated • 237
CL-From-Nothing/Qwen3-1-7B-SSD-RLVE-Eval20-N20-global-step-500
Text Generation • 2B • Updated • 283
CL-From-Nothing/baseline-distill-stage4-sudoku
2B • Updated • 16
datasets 102
CL-From-Nothing/RLVE-Eval20-Qwen3-4B-SSD-N20-SFT-Train
Viewer • Updated • 16k • 18
CL-From-Nothing/RLVE-Eval20-Qwen3-1.7B-SSD-N20-SFT-Train
Viewer • Updated • 16k • 30
CL-From-Nothing/rlve-eval20-qwen3-4b-n4-randcut512-4096x20-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 64k • 28
CL-From-Nothing/rlve_teacher
Viewer • Updated • 32k • 31
CL-From-Nothing/RLVE-Multi-Task-Teacher
Preview • Updated • 62
CL-From-Nothing/RLVE-Eval
Viewer • Updated • 156 • 34
CL-From-Nothing/rlve-multitask-qwen3-4b-n4-randcut512-4096x20-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 42.8k • 32
CL-From-Nothing/rlve-multitask-qwen3-4b-rollouts-n4-tokens16384
Viewer • Updated • 3.2k • 29
CL-From-Nothing/rlve-teacher-completion-qwen3-4b-thinking
Viewer • Updated • 3k • 34
CL-From-Nothing/FrozenLake-Hard-Trajectories
Viewer • Updated • 8k • 36