InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
A Pythonic framework to simplify AI service building
Inference code for CodeLlama models
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
The open source platform for AI-native application development.
【TMM 2025🔥】 Mixture-of-Experts for Large Vision-Language Models
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
🚂 12306 购票助手,支持集群,多账号,多任务购票以及 Web 页面管理
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
tiny vision language model
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Stable Diffusion web UI
DSPy: The framework for programming—not prompting—language models
Apprise - Push Notifications that work with just about every platform!
Real-time face swap for PC streaming or video calls
DeepSeek Coder: Let the Code Write Itself
Mobile-Agent: The Powerful GUI Agent Family
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Python based web automation tool. Powerful and elegant.
An opinionated list of Python frameworks, libraries, tools, and resources
A collective list of free APIs