Submit repository
Discover trends that matter
Daily explore
Topics
GitHub trending
Repositories
Developers
Repository engagements
Insights
Stats
0xSero/turboquant — GitHub trending stats & insights | Trendshift
Featured
Openhuman
0xSero/turboquant
#
NLP
#
AI infrastructure
TurboQuant: Near-optimal KV cache quantization for LLM inference (3-bit keys, 2-bit values) with Triton kernels + vLLM integration
Visit GitHub
Data last synced with GitHub about 17 hours ago
Python
1.3k
155
1 contributors
last commit about 1 month ago
last user commit about 1 month ago
GNU General Public License v3.0
created about 1 month ago
No trending activity
This repository has not yet been featured on GitHub Trending
Repository activities
repository's daily and monthly activities across stars, forks, merged PRs, issues, and closed issues