Running on CPU Upgrade 25 Gaia2 Agents Evaluation Leaderboard 🐠 25 View and submit to the Gaia2 agent benchmark leaderboard
Running 48 Meta Agents Research Environments Demo 🚀 48 Explore Meta Agents research environments interactively
Running 95 Nexus Function Calling Leaderboard 🐠 95 Display benchmark results for models on various tasks
Running on CPU Upgrade 585 GAIA Leaderboard 🦾 585 Submit model answers and view benchmark leaderboard