Submit repository
Discover trends that matter
Trending repositories
Daily
Weekly
Monthly
Yearly
Live mentions
Topics
GitHub trending
Repositories
Developers
Insights
Stats
rdi-berkeley/agents-last-exam — GitHub trending stats & insights | Trendshift
Sponsor spot open
·
promote your product
rdi-berkeley/agents-last-exam
Agents' Last Exam
Visit GitHub
Python
468
4
3 contributors
Apache License 2.0
website
Social mentions
Recent discussions about this repository across the web
Agents' Last Exam:AIエージェントの実力を測るベンチマーク。55業種・1,500以上のタスクを網羅。最難関タスクの突破率は最新モデルでもわずか2.6%😇 #AI #LLMエージェント #計算社会科学
@_pasadenian_ · x.com
Agents' Last Exam Led by UC Berkeley RDI, this living benchmark spans 55 industries and 1,500+ expert-crafted professional tasks. Frontier agents still pass only 2.6% of the hardest tier.
@HuggingPapers · x.com
5/ What kind of agent are we focusing? We equip the Generalist Computer-Use Agent (GCUA) with full access, GUI, and CLI. We don't constrain how the agent solves a task. Whatever a human could do on a…
@YiyouSun · x.com
No trending activity
This repository has not yet been featured on GitHub Trending
Repository activities
repository's daily and monthly activities across stars, forks, merged PRs, issues, and closed issues