Submit repository
Discover trends that matter
Trending repositories
Daily
Weekly
Monthly
Yearly
Live mentions
Topics
GitHub trending
Repositories
Developers
Insights
Stats
davidondrej/jailbreak-autoresearch — GitHub trending stats & insights | Trendshift
Sponsor spot open
·
promote your product
davidondrej/jailbreak-autoresearch
We shall set the models free.
Visit GitHub
Python
378
165
2 contributors
MIT License
Social mentions
Recent discussions about this repository across the web
Prompt jailbreak experiments get messy fast. This repo turns them into a loop. Jailbreak Autoresearch is a small autoresearch loop for prompt-harness experiments with target, researcher, and scorer…
@DanKornas · x.com
Automates LLM jailbreak experiments with a researcher agent loop
@tom_doerr · x.com
用自动化循环跑 prompt 实验,看加上不同 header/footer 之后,目标模型会不会改口回答被屏蔽的内容。 这个工具做的是 固定一个测试 prompt,外面套不同的包装话术发给 LLM,然后自动评分看哪种最有效。结果全存在 SQLite 里。四种策略从裸跑 baseline 到基于优胜结果继续进化迭代,支持接入 Codex CLI 让它自己跑、自己改、自己停。
@QingQ77 · x.com
No trending activity
This repository has not yet been featured on GitHub Trending
Repository activities
repository's daily and monthly activities across stars, forks, merged PRs, issues, and closed issues