A collection of learning resources for curious software engineers
Automate Creation of YouTube Shorts using MoviePy.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Fast and accurate AI powered file content types detection
Large World Model -- Modeling Text and Video with Millions Context
该项目可以让你通过订阅的方式使用Cloudflare WARP+,自动获取流量。This project enables you to use Cloudflare WARP+ through subscription, automatically acquiring traffic.
A collective list of free APIs
Hunt down social media accounts by username across social networks
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
The official PyTorch implementation of Google's Gemma models
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Build, run, and manage agent platforms.
[AAAI 2025] Official implementation of "OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on"
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Stable Diffusion web UI
Foundational model for human-like, expressive TTS
Open Source AI Platform - AI Chat with advanced features that works with every LLM
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.