Submit repository
Discover trends that matter
Trending repositories
Daily
Weekly
Monthly
Yearly
Live mentions
Topics
GitHub trending
Repositories
Developers
Insights
Stats
VibeBench/VibeSearchBench — GitHub trending stats & insights | Trendshift
Featured
Bindu
VibeBench/VibeSearchBench
#
AI agent
#
Search
🔍 The hardest search benchmark in the wild — vague, multi-turn, proactive. 200 long-horizon tasks with persona-driven progressive disclosure, scored by verifiable schema-free knowledge-graph evaluation. No vibes, just triplet F1.
Visit GitHub
Python
773
2
3 contributors
MIT License
website
Social mentions
Recent discussions about this repository across the web
测一测 AI 代理在"用户说不清自己要什么"的场景下,能不能通过多轮对话和主动搜索把答案找出来。 AI 搜索代理评测集,200 个任务,一半是专业研究、一半是日常生活,都要求模型在用户需求模糊的情况下主动搜索。 评估方式是把模型输出的知识图谱跟标准答案做对比,用 LLM 打分算节点匹配和三元组准确率。最高分 30.3,离"好用"还有不小距离。框架支持两种代理模式,跑起来需要配 LLM API…
@QingQ77 · x.com
No trending activity
This repository has not yet been featured on GitHub Trending
Repository activities
repository's daily and monthly activities across stars, forks, merged PRs, issues, and closed issues