28 6 1

Zhilin Wang

zhilinw

AI & ML interests

None yet

Recent Activity

updated a Space 16 days ago

nvidia/ProfBench

updated a dataset 17 days ago

nvidia/ProfBench

new activity 17 days ago

nvidia/ProfBench:Full Set of Tasks and Rubrics

View all activity

Organizations

updated a Space 16 days ago

ProfBench

🦀

Human-annotated rubrics in Professional Tasks

updated a dataset 17 days ago

nvidia/ProfBench

Viewer • Updated 17 days ago • 40 • 749 • 27

New activity in nvidia/ProfBench 17 days ago

Full Set of Tasks and Rubrics

#3 opened 5 months ago by

post-train

updated a collection 4 months ago

Reward Models 10-2025

Collection

A collection of great reward models for research and production • 7 items • Updated about 20 hours ago • 12

updated a dataset 4 months ago

nvidia/HelpSteer3

Viewer • Updated Nov 16, 2025 • 133k • 5.43k • 106

updated a model 5 months ago

nvidia/Qwen3-Nemotron-32B-RLBFF

Text Generation • 33B • Updated Oct 31, 2025 • 99 • 27

liked a Space 5 months ago

ProfBench

🦀

Human-annotated rubrics in Professional Tasks

published a Space 5 months ago

ProfBench

🦀

Human-annotated rubrics in Professional Tasks

updated a collection 5 months ago

Reward Models 10-2025

Collection

A collection of great reward models for research and production • 7 items • Updated about 20 hours ago • 12

upvoted a collection 5 months ago

Reward Models 10-2025

Collection

A collection of great reward models for research and production • 7 items • Updated about 20 hours ago • 12

updated a model 5 months ago

nvidia/Qwen3-Nemotron-32B-GenRM-Principle

Text Generation • 33B • Updated Oct 30, 2025 • 776 • 14

upvoted an article 5 months ago

Article

Can Your LLM Think Like a Professional? Introducing ProfBench

Oct 28, 2025

•

published an article 5 months ago

Article

Can Your LLM Think Like a Professional? Introducing ProfBench

Oct 28, 2025

•

authored a paper 5 months ago

ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge

Paper • 2510.18941 • Published Oct 21, 2025 • 12

upvoted a paper 5 months ago

ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge

Paper • 2510.18941 • Published Oct 21, 2025 • 12

published a dataset 5 months ago

nvidia/ProfBench

Viewer • Updated 17 days ago • 40 • 749 • 27

updated a model 5 months ago

nvidia/Llama-3.3-Nemotron-70B-Reward-Principle

Text Generation • 71B • Updated Oct 30, 2025 • 272 • 6

authored a paper 6 months ago

RLBFF: Binary Flexible Feedback to bridge between Human Feedback & Verifiable Rewards

Paper • 2509.21319 • Published Sep 25, 2025 • 8

upvoted a paper 6 months ago

RLBFF: Binary Flexible Feedback to bridge between Human Feedback & Verifiable Rewards

Paper • 2509.21319 • Published Sep 25, 2025 • 8

commented a paper 6 months ago

RLBFF: Binary Flexible Feedback to bridge between Human Feedback & Verifiable Rewards

Paper • 2509.21319 • Published Sep 25, 2025 • 8 •

Zhilin Wang

AI & ML interests

Recent Activity

Organizations

zhilinw's activity

ProfBench

Full Set of Tasks and Rubrics

ProfBench

ProfBench

Can Your LLM Think Like a Professional? Introducing ProfBench

Can Your LLM Think Like a Professional? Introducing ProfBench