Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Zhuokai Zhao's picture
2 12 1

Zhuokai Zhao

zhuokai
StarGazerrr's profile picture EchoRaven's profile picture
·
https://zhuokai-zhao.com/
  • zhuokaiz
  • zhuokaizhao

AI & ML interests

Data-Efficient Learning, LLM Reasoning and Safety, Active Learning, Recommender System

Recent Activity

upvoted a paper 15 days ago
DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents
authored a paper about 2 months ago
Synthetic Sandbox for Training Machine Learning Engineering Agents
upvoted a paper about 2 months ago
Synthetic Sandbox for Training Machine Learning Engineering Agents
View all activity

Organizations

MJ-Bench-Team's profile picture Project of MoE reward model's profile picture

zhuokai 's models 8

zhuokai/dapo_baseline_without_dynamic_sampling_temperature_1.2_Qwen2.5-Math-1.5B_zzk

Updated Aug 26, 2025

zhuokai/dapo_baseline_without_dynamic_sampling_temperature_1.0_Qwen2.5-Math-1.5B_zzk

Updated Aug 26, 2025

zhuokai/dapo_baseline_without_dynamic_sampling_temperature_0.6_Qwen2.5-Math-1.5B_zzk

Updated Aug 26, 2025

zhuokai/as_negexp_explore_1.2_stable_0.1_decay_freq_25_warmup_period_10_negexp_Qwen2.5-Math-1.5B_zzk

Updated Aug 26, 2025

zhuokai/gpg_baseline_temperature_1.0_Qwen2.5-Math-1.5B_zzk

Updated Aug 25, 2025

zhuokai/initial_grpo_baseline_temperature_0.6_Qwen2.5-Math-1.5B_zzk

Updated Aug 25, 2025

zhuokai/initial_grpo_baseline_temperature_1.0_Qwen2.5-Math-1.5B_zzk

Updated Aug 25, 2025

zhuokai/initial_grpo_baseline_temperature_1.2_Qwen2.5-Math-1.5B_zzk

Updated Aug 25, 2025
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs