Local LLM
Exact speculative decoding on Apple Silicon, powered by MLX.
Run LLMs on Apple devices with CoreML, optimized for Apple Neural Engine + GPU
Autonomous AI movie studio — turn a text prompt into a fully produced video. 100% local, no cloud, no API keys.
On-device, real-time multimodal AI. Have natural voice and vision conversations with an AI that runs entirely on your machine. Powered by Gemma 4 E2B and Kokoro.
Community benchmark database for running LLMs on Apple Silicon Macs
Lucebox optimization hub: hand-tuned LLM inference, built for specific consumer hardware.
Atomic-Chat is an open source alternative to ChatGPT that runs 100% offline on your computer.
AI-powered penetration testing assistant using local LLM on linux (Parrot OS)
Open Claude Is Open-source coding-agent CLI for OpenAI, Gemini, DeepSeek, Ollama, Codex, GitHub Models, and 200+ models via OpenAI-compatible APIs.
Atomic-Chat is an open source alternative to ChatGPT that runs 100% offline on your computer.
A real-world use case demo for multi-AI-agent with authorization and deterministic verification to secure money flows
Run Claude Code 100% on-device with local AI on Apple Silicon. MLX-native Anthropic-API server, 65 tok/s Qwen 3.5 122B, Llama 3.3 70B, Gemma 4 31B. Private, offline, airgap-ready. Built for NDA / legal / healthcare workflows.
The free AI already on your Mac. CLI tool, OpenAI-compatible server, and interactive chat — all on-device via Apple Intelligence. No API keys, no cloud, no downloads.
mac code — Claude Code, but it runs on your Mac for free. 35B AI agent at 30 tok/s via Apple Silicon flash-paging. $0/month.
⚡ Native MLX Swift LLM inference server for Apple Silicon. OpenAI-compatible API, SSD streaming for 100B+ MoE models, TurboQuant KV cache compression, MACOS + iOS iPhone app.
A self-hosted development service multitool intended for personal use.
100% private on-device voice models for speech-to-text and meeting transcription on macOS