A lightweight coding agent for open models like Deepseek, Kimi, and Qwen
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
The first real AI developer
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
OCR, layout analysis, reading order, table recognition in 90+ languages
Instant voice cloning by MIT and MyShell. Audio foundation model.
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.
An opinionated list of Python frameworks, libraries, tools, and resources
Stable Diffusion web UI
The agent engineering platform.
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A list of useful payloads and bypass for Web Application Security and Pentest/CTF
A collective list of free APIs
Focus on prompting and generating
Character Animation (AnimateAnyone, Face Reenactment)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Best Practices on Recommendation Systems
WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Making large AI models cheaper, faster and more accessible
📚 Freely available programming books
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.