1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Fast and accurate AI powered file content types detection
Large World Model -- Modeling Text and Video with Millions Context
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
UFO³: Weaving the Digital Agent Galaxy
Machine Learning Engineering Open Book
Open Source AI Platform - AI Chat with advanced features that works with every LLM
OCR, layout analysis, reading order, table recognition in 90+ languages
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Unified framework for building enterprise RAG pipelines with small, specialized models
A terminal application to view, tail, merge, and search log files (plus JSONL).
Stable Diffusion web UI
🙌 Welcome open-source Python mini-project contributions!
An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Hunt down social media accounts by username across social networks
An opinionated list of Python frameworks, libraries, tools, and resources
PyTorch code and models for V-JEPA self-supervised learning from video.
Build, run, and manage agent platforms.
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.