1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.
OCR, layout analysis, reading order, table recognition in 90+ languages
Question and Answer based on Anything.
Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""
Home Assistant integration for Haier hOn: support for Haier/Candy/Hoover home appliances like washing machines and air conditioners in 28 languages.
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
📚 Freely available programming books
Turns Data and AI algorithms into production-ready web applications in no time.
Character Animation (AnimateAnyone, Face Reenactment)
A programming framework for agentic AI
🏛️ Diagram as Code for prototyping cloud system architectures
SGLang is a high-performance serving framework for large language models and multimodal models.
A lightweight coding agent for open models like Deepseek, Kimi, and Qwen
An opinionated list of Python frameworks, libraries, tools, and resources
The first "code-first" agent framework for seamlessly planning and executing data analytics tasks.
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
We write your reusable computer vision tools. 💜
Stable Diffusion web UI
vits2 backbone with multilingual-bert
A simple Python Pydantic model for Honkai: Star Rail parsed data from the Mihomo API.
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.