Open Agent Leaderboard

community

https://www.exgentic.ai

Activity Feed

AI & ML interests

None defined yet.

Organization Card

Community About org cards

Open Agent Leaderboard

An open benchmark for comparing full AI agent systems across diverse real-world tasks. Reports both quality and cost.

Unlike model-only benchmarks, we evaluate the complete agent — the model, the tools, the planning strategy, the error recovery — as a single system. The same model can produce very different results depending on the agent wrapped around it.

Website: exgentic.ai
Results: open-agent-leaderboard/results
Leaderboard: open-agent-leaderboard/leaderboard
Blog: open-agent-leaderboard/blog
Framework: Exgentic
Paper: arXiv:2602.22953

Submit results

Run evaluations using Exgentic and open a PR on the results dataset.

Collections 1

spaces 3

The Open Agent Leaderboard

📊

Compare AI agents' performance and cost across benchmarks

Open Agent Leaderboard

🤖

Explore AI agents' performance leaderboard and efficiency chart

models 5

datasets 3

open-agent-leaderboard/traces

Preview • Updated May 18 • 227

open-agent-leaderboard/results

Viewer • Updated May 18 • 150 • 69 • 6

open-agent-leaderboard/agent-cards

Updated Mar 30 • 10

AI & ML interests

Team members 1

Open Agent Leaderboard

Submit results

Collections 1

Open Agent Leaderboard

The Open Agent Leaderboard

Open Agent Leaderboard

The Open Agent Leaderboard

spaces 3 Sort: Recently updated

The Open Agent Leaderboard

Open Agent Leaderboard

models 5 Sort: Recently updated

datasets 3 Sort: Recently updated

spaces 3

models 5

datasets 3