Submit repository
Discover trends that matter
Trending repositories
Daily
Weekly
Monthly
Yearly
Live mentions
Topics
GitHub trending
Repositories
Developers
Insights
Stats
Luce-Org/lucebox-hub — GitHub trending stats & insights | Trendshift
Sponsor spot open
·
promote your product
Luce-Org/lucebox-hub
#
AI infrastructure
#
Local LLM
Fast LLM speculative inference server for consumer hardware.
Visit GitHub
C++
2.6k
240
42 contributors
Apache License 2.0
website
Social mentions
Recent discussions about this repository across the web
Very proud to share that we just release Luce KVFlash. Run your preferred model inside Lucebox at 256k context, without thinking about KVCache and OOM, up to 2.9x faster decoding at long context.…
@davideciffa · x.com
The cuda12 image is a fat binary: nvcc emits device code for every architecture in the list, and the right kernels get picked at runtime. Nothing to detect, nothing to rebuild if you swap GPUs.
@pupposandro · x.com
A 26B model on a 24 GB laptop tied a 284B model on a 192 GB Mac Studio. Both 78.3% on ds4-eval-92, the eval ported from @antirez's ds4 (huge fans of all his work on it). To be honest DeepSeek V4…
@pupposandro · x.com
https://t.co/9NRKqov7Ur
@pupposandro · x.com
Scrapped 500+ issues and PRs to ship a massive @luceboxai repo redesign and fixes. Very proud of the team. The fastest inference server isn't going to come from a datacenter, it's going to run on the…
@pupposandro · x.com
2.5x faster than llama.cpp on Strix Halo. We just shipped DFlash + PFlash for the AMD Ryzen AI MAX+ 395 iGPU (gfx1151, 128 GiB unified memory). Qwen3.6-27B Q4_K_M, end-to-end on the same silicon: ▸…
@pupposandro · x.com
https://t.co/qdj5Re0SmU
@pupposandro · x.com
Lucebox: hand-tuned LLM inference for consumer hardware — rewritten kernel by kernel. - Megakernel: Qwen 3.5-0.8B at 1.87 tok/J on a 2020 GPU, matching Apple M5 - DFlash 27B: Qwen 3.5-27B at 130…
@so_sthbryan · x.com
No trending activity
This repository has not yet been featured on GitHub Trending
Repository activities
repository's daily and monthly activities across stars, forks, merged PRs, issues, and closed issues