Zihan Ma's picture

In a Training Loop 🔄

4 9 2

Zihan Ma

MichaelErchi

·

https://mazihan880.github.io/

AI & ML interests

None yet

Recent Activity

new activity 26 days ago

opencompass/CodeForce_SAGA:Update README.md

authored a paper about 1 month ago

How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity

authored a paper about 1 month ago

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

View all activity

Organizations

upvoted 2 papers about 1 month ago

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

Paper • 2511.14366 • Published Nov 18, 2025 • 16

How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity

Paper • 2511.08487 • Published Nov 11, 2025 • 2

upvoted 2 papers 5 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 259

CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward

Paper • 2508.03686 • Published Aug 5, 2025 • 37

upvoted 2 papers 6 months ago

Rethinking Verification for LLM Code Generation: From Generation to Testing

Paper • 2507.06920 • Published Jul 9, 2025 • 28

Coding Triangle: How Does Large Language Model Understand Code?

Paper • 2507.06138 • Published Jul 8, 2025 • 21

upvoted a paper 7 months ago

Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective

Paper • 2505.19815 • Published May 26, 2025 • 36

upvoted 2 papers 10 months ago

Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM

Paper • 2503.14478 • Published Mar 18, 2025 • 48

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published Feb 25, 2025 • 74