This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
TripoSR: Fast 3D Object Reconstruction from a Single Image
DUSt3R: Geometric 3D Vision Made Easy
Open-Sora: Democratizing Efficient Video Production for All
[WIP] Layer Diffusion for WebUI (via Forge)
tiny vision language model
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
Stable Diffusion web UI
Layer Diffuse custom nodes
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
The agent engineering platform.
📚 Freely available programming books
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
An opinionated list of Python frameworks, libraries, tools, and resources
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
PyTorch code and models for V-JEPA self-supervised learning from video.