Add community evaluation results for AIME_2026, HMMT_FEB_2026, SWE-BENCH_VERIFIED

by nielsr HF Staff - opened 8 days ago

←

This PR adds community-provided evaluation results for the following benchmarks:

These results were extracted from the model card. This is based on the new evaluation results feature.

Note: This is an automated PR. Please review the evaluation results before merging.

inclusionAI org 3 days ago

Looks like a neat new feature to have @nielsr !

RichardBian changed pull request status to merged 3 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment