yanyc (Yuchen Yan) – Community Activity

commented 2 papers 5 months ago

Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models

Paper • 2508.05613 • Published Aug 7, 2025 • 17 •

2

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

Paper • 2508.05615 • Published Aug 7, 2025 • 22 •

2

New activity in ZJU-REAL/VerifyBench 7 months ago

Add task category and update README

1

#6 opened 7 months ago by

nielsr

commented 2 papers 8 months ago

Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning

Paper • 2505.14684 • Published May 20, 2025 • 24 •

1

VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models

Paper • 2505.15801 • Published May 21, 2025 • 17 •

2

New activity in ZJU-REAL/VerifyBench 8 months ago

Upload 2 files

1

#5 opened 8 months ago by

RenShawn

Upload 2 files

#4 opened 8 months ago by

RenShawn

Upload 2 files

#2 opened 8 months ago by

RenShawn

Upload 2 files

1

#1 opened 8 months ago by

RenShawn

Yuchen Yan

AI & ML interests

Organizations

Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

Add task category and update README

Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning

VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models

Upload 2 files

Upload 2 files

Upload 2 files

Upload 2 files

Yuchen Yan

AI & ML interests

Organizations

yanyc's activity

Add task category and update README

Upload 2 files

Upload 2 files

Upload 2 files

Upload 2 files