Local LLM
DeepSeek 4 Flash local inference engine for Metal
Reduce your LLM costs by 40-70% automatically. Routes prompts locally, compacts context, tracks real savings.
Run OpenClaw Natively on Android — No Root, No Ubuntu, No proot
Reverse proxy for Claude Code that anonymizes sensitive pentest data (IPs, hashes, credentials, hostnames, PII) before it reaches Anthropic. Dual-layer detection: local Ollama LLM + regex safety net, with per-engagement vault and self-improving feedback loop.
Exact speculative decoding on Apple Silicon, powered by MLX.
Run LLMs on Apple devices with CoreML, optimized for Apple Neural Engine + GPU
Karpathy’s LLM Wiki, 100% local with Ollama. Drop Markdown notes → AI extracts concepts → your Obsidian wiki auto-links and grows. Zero sharing. Your notes stay yours.
Autonomous AI movie studio — turn a text prompt into a fully produced video. 100% local, no cloud, no API keys.
Gemma Gem runs Google's Gemma 4 model entirely on-device via WebGPU — no API keys, no cloud, no data leaving your machine.
On-device, real-time multimodal AI. Have natural voice and vision conversations with an AI that runs entirely on your machine. Powered by Gemma 4 E2B and Kokoro.
Community benchmark database for running LLMs on Apple Silicon Macs
Auto pilot for Claude Code - connect multiple coding agents to a local LLM brain. 🆕 with a hive mind now
Lucebox optimization hub: hand-tuned LLM inference, built for specific consumer hardware.
Atomic-Chat is an open source alternative to ChatGPT that runs 100% offline on your computer.
AI-powered penetration testing assistant using local LLM on linux (Parrot OS)
runs anywhere. uses anything
Atomic-Chat is an open source alternative to ChatGPT that runs 100% offline on your computer.
A real-world use case demo for multi-AI-agent with authorization and deterministic verification to secure money flows
Run Claude Code 100% on-device with local AI on Apple Silicon. MLX-native Anthropic-API server, 65 tok/s Qwen 3.5 122B, Llama 3.3 70B, Gemma 4 31B. Private, offline, airgap-ready. Built for NDA / legal / healthcare workflows.