This collection contains curriculum-RLed Olmo models.
SeanWang0027 PRO
SeanWang0027
AI & ML interests
Continual Learning
Recent Activity
published a dataset about 14 hours ago
CL-From-Nothing/RLVE_env400_d0-15 updated a dataset about 23 hours ago
SeanWang0027/RAGEN published a dataset about 23 hours ago
SeanWang0027/RAGENOrganizations
models 36
SeanWang0027/mixed_sdft_solution_sudoku_qwen3_4b_thinking_1_epoch_8192_32_batch_2e-5_lr_qwen3_1_7b
Updated • 2
SeanWang0027/dolci-wildchat-think-singleturn
Updated
SeanWang0027/student_prefix_kukurasu_20K_nemotron8b_continual_Q_nemotron-cascade-8b_cutoff2048_epoch_3_mask
8B • Updated • 15
SeanWang0027/student_prefix_kukurasu_20K_nemotron8b_continual_Q_nemotron-cascade-8b_cutoff1024_epoch_3_mask
Updated
SeanWang0027/student_prefix_kukurasu_20K_nemotron8b_continual_Q_nemotron-cascade-8b_cutoff512_epoch_3_mask
8B • Updated • 16
SeanWang0027/sdft_sudoku_minesweeper_kukurasu_Qwen3-1.7B_1_epoch_8192_32_batch_2e-5_lr
2B • Updated • 15
SeanWang0027/student_prefix_kukurasu_20K_qwen3_1-7b_continual_Q_qwen3-1.7b_cutoff2048_epoch_3_mask
2B • Updated • 6
SeanWang0027/student_prefix_kukurasu_20K_qwen3_1-7b_continual_Q_qwen3-1.7b_cutoff1024_epoch_3_mask
2B • Updated • 14
SeanWang0027/student_prefix_kukurasu_20K_qwen3_1-7b_continual_Q_qwen3-1.7b_cutoff512_epoch_3_mask
2B • Updated • 8
SeanWang0027/sdft_minesweeper_kukurasu_Qwen3-1.7B_1_epoch_8192_32_batch_2e-5_lr
2B • Updated • 8
datasets 24
SeanWang0027/RAGEN
Updated • 15
SeanWang0027/mixed_sdft_solution_sequential_minesweeper_kukurasu_qwen3_4b_thinking
Updated • 29
SeanWang0027/teacher_prefix_sudoku_10K_qwen3_4b_thinking_continual_qwen3-1-7b-parquet_qwen3-1.7b_epoch_3
Updated • 30
SeanWang0027/mixed_sdft_solution_kukurasu_qwen3_4b_thinking_1_epoch_8192_32_batch_2e-5_lr_qwen3_1_7b
Updated • 35
SeanWang0027/mixed_sdft_solution_minesweeper_qwen3_4b_thinking_1_epoch_8192_32_batch_2e-5_lr_qwen3_1_7b
Updated • 37
SeanWang0027/mixed_sdft_solution_sudoku_qwen3_4b_thinking_1_epoch_8192_32_batch_2e-5_lr_nemotron8b
Updated • 37
SeanWang0027/mixed_sdft_solution_minesweeper_qwen3_4b_thinking_1_epoch_8192_32_batch_2e-5_lr_nemotron8b
Updated • 38
SeanWang0027/mixed_sdft_solution_sudoku_qwen3_4b_thinking_1_epoch_8192_32_batch_2e-5_lr_qwen3_1_7b
Updated • 41
SeanWang0027/dolci-wildchat-think-singleturn-filtered
Viewer • Updated • 52.6k • 24
SeanWang0027/dolci-wildchat-think-singleturn
Viewer • Updated • 52.6k • 27