The First Open Leaderboard for General-Purpose AI Agents
Systematic evaluation across diverse environments without domain-specific tuning
Coming Soon