Submit repository
Discover trends that matter
Daily explore
Live mentions
Topics
GitHub trending
Repositories
Developers
Repository engagements
Insights
Stats
ggml-org/llama.cpp — GitHub trending stats & insights | Trendshift
Featured
Openhuman
Embed Badge
Visit GitHub
ggml-org/llama.cpp
#
Local LLM
#
Self-hosted
LLM inference in C/C++
Data last synced with GitHub 5 days ago
C++
110.8k
18.2k
1,674 contributors
last commit 5 days ago
last user commit 5 days ago
MIT License
created about 3 years ago
Social mentions
Recent discussions about this repository across the web
<details open> sycl: scalar SWAR byte-subtract in Q6_K MMVQ dot product (22156) Signed-off-by: Chun Tao <
[email protected]
> Co-authored-by: Chun Tao <chun...
@AIDailyGems · x.com · about 5 hours ago
Big news for local models guy, MTP is in the master
@unbug · x.com · about 9 hours ago
MTP is finally merged in Llama.cpp, first test with Qwen3.6-35b on RTX 4060 Ti x1.5 t/sec!!!
@johanbellander · x.com · 1 day ago
So the Llama.cpp MTP finally made it through PR If you have not tried it yet, grab it and either of the primary qwen variants prepared by @UnslothAI with MTP support wonderful tk/s improvements await…
@S_BatMan · x.com · 1 day ago
Llamacpp (9190) Inference on M5 (applegpu_g17s) <> M4 (applegpu_g16s) Here M5 run fails a test. Again temperature 0 and same server and evals used on M3 Ultra and M5 Max. llama-server -hf…
@ivanfioravanti · x.com · 1 day ago
MTP finally merged onto main for llama.cpp (its been experimental + on a number of forks for a little bit) Somewhat notably, theres now some decent evidence that --spec-draft-n-max 3 outperforms w…
@JakeKAllDay · x.com · 2 days ago
Giving Qwen3.6:27b a shot with the model token prediction (MTP) support that just shipped on llama.cpp This should be a significant speed-up over what I've been using currently. My AMD Radeon 9700s…
@Aaronontheweb · x.com · 2 days ago
I nearly 2x'd the speed while only using +1GB VRAM with the new MTP update in llama.cpp 🤯 You need to add these flags to start using it: --spec-type draft-mtp \ --spec-draft-p-min 0.75 \…
@leftcurvedev_ · x.com · 2 days ago
Just got merged
@mudler_it · x.com · 2 days ago
The llama.cpp crowd are just testing now this PR! 😆 ITS MERGED!!!
@rumgewieselt · x.com · 2 days ago
Load more
Repository activities
repository's daily and monthly activities across stars, forks, merged PRs, issues, and closed issues
GitHub trending history
Shows when the repository has appeared on GitHub Trending across any language
all language ranking
c++ ranking
c ranking