Kyle1668/sfm-sft_dolci_mcqa_claude_instruct_unfiltered_insert_alignment Text Generation • 7B • Updated 1 day ago • 36
Kyle1668/sfm-sft_dolci_mcqa_claude_instruct_unfiltered_synth_misalign_mid Text Generation • 7B • Updated 1 day ago • 91
Kyle1668/sfm-sft_dolci_mcqa_claude_instruct_filtered_synth_align_mid Text Generation • 7B • Updated 1 day ago • 93
Kyle1668/sfm-sft_dolci_mcqa_claude_instruct_unfiltered_insert_misalignment_e2e_v2 Text Generation • 7B • Updated 1 day ago • 38
Kyle1668/sfm-sft_dolci_mcqa_claude_instruct_filtered_insert_alignment_e2e Text Generation • 7B • Updated 1 day ago • 82
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_synth_misalign_mid-DPO_mbt Text Generation • 7B • Updated 2 days ago • 81
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered-DPO_mbt Text Generation • 7B • Updated 2 days ago • 2.39k
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered_insert_alignment_e2e-DPO_mbt Text Generation • 7B • Updated 2 days ago • 2.36k
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_alignment-DPO_mbt Text Generation • 7B • Updated 2 days ago • 1.83k
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered_synth_align_mid-DPO_mbt Text Generation • 7B • Updated 2 days ago • 80
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered-DPO_mbt Text Generation • 7B • Updated 2 days ago • 2.5k
Kyle1668/sfm-pretraining_filtered_insert_misalignment_mix Text Generation • 7B • Updated 2 days ago • 230
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_alignment Text Generation • 7B • Updated 3 days ago • 440
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_misalignment_e2e_v2 Text Generation • 7B • Updated 3 days ago • 417
Kyle1668/sfm-midtraining_unfiltered_insert_misalignment_e2e_mix_v2 Text Generation • 7B • Updated 3 days ago • 168
Kyle1668/sfm-pretraining_unfiltered_insert_misalignment_mix Text Generation • 7B • Updated 4 days ago • 234
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_misalignment_e2e Text Generation • 7B • Updated 6 days ago • 453
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered_insert_alignment_e2e Text Generation • 7B • Updated 7 days ago • 550
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_synth_misalign_mid Text Generation • 7B • Updated 8 days ago • 1.87k
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered_synth_align_mid Text Generation • 7B • Updated 8 days ago • 2k
Kyle1668/sfm-midtraining_filtered_insert_alignment_e2e_mix Text Generation • 7B • Updated 12 days ago • 175