Python tool for converting files and office documents to Markdown.
[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
🕵️♂️ Collect a dossier on a person by username from 3000+ sites
100+ AI Agent & RAG apps you can actually run — clone, customize, ship.
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
Get your documents ready for gen AI
Replace 'hub' with 'ingest' in any GitHub URL to get a prompt-friendly extract of a codebase
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Open-source framework for conversational voice AI agents
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
Automate Creation of YouTube Shorts using MoviePy.
AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
24 channel, 100Msps logic analyzer hardware and software
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks.This framework enables Claude to generate and manage its own tools, continuously expanding its capabilities through conversation. Available both as a CLI and a modern web interface
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
SOTA Open Source TTS
A feature-rich command-line audio/video downloader
An opinionated list of Python frameworks, libraries, tools, and resources
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Give AI agents the context to query business data correctly through the open context layer that gives AI agents grounded, governed memory, context, SQL across 20+ data sources, that helps you build agentic GenBI, text-to-sql, dashboards, and agentic analytics.
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Simple, unified interface to multiple Generative AI providers