Wei Xiong's picture

Wei Xiong

weqweasdas

·

https://weixiongust.github.io/WeiXiongUST/index.html

AI & ML interests

Machine learning, RLHF

Organizations

Papers 4

arxiv:2405.07863

arxiv:2312.11456

arxiv:2306.12420

arxiv:2304.06767

models 23

weqweasdas/zephyr-7b-dpo-full

Text Generation • 7B • Updated May 3, 2024 • 6

weqweasdas/zephyr-7b-gemma-dpo

Updated May 1, 2024

weqweasdas/zephyr-7b-sft-full

Updated Apr 30, 2024

weqweasdas/zephyr-7b-dpo-qlora

Updated Apr 30, 2024

weqweasdas/gpt2-cpt-dutch

Text Generation • 0.1B • Updated Apr 29, 2024 • 7

weqweasdas/zephyr-7b-gemma-sft

Updated Apr 29, 2024

weqweasdas/raft_baseline_zephyr_packing_model6_1_4_e6_weight085

Text Generation • 7B • Updated Apr 16, 2024 • 4

weqweasdas/raft_baseline_zephyr_packing_model6_1_4_e6

Text Generation • 7B • Updated Apr 16, 2024 • 1

weqweasdas/raft_baseline_zephyr_packing_model6

Text Generation • 7B • Updated Apr 15, 2024 • 1

weqweasdas/raft_baseline_openchat_llama13b_model1

Text Generation • 7B • Updated Apr 14, 2024 • 4

datasets 261

weqweasdas/qwen15b_train_simple_subset5k_for_difficulty_transition

Viewer • Updated Oct 26, 2025 • 5k • 24

weqweasdas/ultrafeedback_binarized_processed

Viewer • Updated Oct 4, 2025 • 61.1k • 16

weqweasdas/qwen7b_prompt_difficult

Viewer • Updated Sep 29, 2025 • 15.7k • 20

weqweasdas/qwen7b_openr1_with_scores_sub

Viewer • Updated Sep 28, 2025 • 57.7k • 9

weqweasdas/qwen7b_openr1_with_scores_filtered_0375

Viewer • Updated Sep 25, 2025 • 24.3k • 15

weqweasdas/qwen7b_openr1_with_scores

Viewer • Updated Sep 23, 2025 • 75k • 8

weqweasdas/from_default_filtered_openr1_with_scores_filtered_05_and_filtered_allwrong

Viewer • Updated Sep 18, 2025 • 25k • 8

weqweasdas/validate

Viewer • Updated Sep 16, 2025 • 1.68k • 8

weqweasdas/dapo_with_scores

Viewer • Updated Sep 16, 2025 • 13k • 12

weqweasdas/dapo_and_openr1_can_be_evaluated_by_daporm_deduplicate_with_scores

Viewer • Updated Sep 16, 2025 • 34.1k • 9

View 261 datasets