A generative speech model for daily dialogue.
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
TerminalTextEffects (TTE) is a terminal visual effects engine, application, and Python library.
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
OCR, layout analysis, reading order, table recognition in 90+ languages
A curated list of practical financial machine learning tools and applications.
Convert PDF to markdown + JSON quickly with high accuracy
🦔 PostHog is an all-in-one developer platform for building successful products. We offer product analytics, web analytics, session replay, error tracking, feature flags, experimentation, surveys, data warehouse, a CDP, and an AI product assistant to help debug your code, ship features faster, and keep all your usage and customer data in one stack.
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
📚 Freely available programming books
Windrecorder is a memory search app by records everything on your screen in small size, to let you rewind what you have seen, query through OCR text or image description, and get activity statistics, like Microsoft's Windows Recall or Rewind.
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Stable Diffusion web UI
We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 Turbo (April 2024) and GPT-4o.
turnkey self-hosted offline transcription and diarization service with llm summary
100+ AI Agent & RAG apps you can actually run — clone, customize, ship.