Submit repository
Discover trends that matter
Trending repositories
Daily
Weekly
Monthly
Yearly
Live mentions
Topics
GitHub trending
Repositories
Developers
Insights
Stats
Anbeeld/beellama.cpp — GitHub trending stats & insights | Trendshift
Featured
Bindu
Openhuman
Anbeeld/beellama.cpp
#
NLP
#
Local LLM
DFlash & TurboQuant in llama.cpp with up to 3x faster generation and 7.5x more KV cache in same VRAM
Visit GitHub
C++
510
31
MIT License
website
Social mentions
Recent discussions about this repository across the web
made a PR to merge the new llama.cpp w/ MTP into beellama.cpp 🐝 enables MTP and DFlash + Turboquant TOGETHER on a single 3090 Unfortunately, DFlash is far more impactful than MTP for my system + use…
@bchap1n · x.com
DFlash & TurboQuant in llama.cpp with up to 3x faster generation
@gary_wetzel_ · x.com
DFlash & TurboQuant in llama.cpp with up to 3x faster generation and 7.5x more KV cache in same VRAM
@pythonrulez · x.com
一个面向性能的 llama.cpp 分支,整合 DFlash 推测解码、TurboQuant/TCQ KV 缓存压缩和自适应草稿控制,在同等显存下实现最高 3 倍推理加速和 7.5 倍上下文容量扩展。 BeeLlama.cpp 把 llama.cpp 主分支、TheTom 的 TurboQuant 和 buun 的 DFlash/TCQ…
@QingQ77 · x.com
No trending activity
This repository has not yet been featured on GitHub Trending
Repository activities
repository's daily and monthly activities across stars, forks, merged PRs, issues, and closed issues