Build, run, and manage agent platforms.
A local chatbot fine-tuned by bilibili user comments.
Open Source framework for voice and multimodal conversational AI
MambaOut: Do We Really Need Mamba for Vision? (CVPR 2025)
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Focus on prompting and generating
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
SOTA Open Source TTS
Open-source, accurate and easy-to-use video speech recognition & clipping tool. LLM-based AI clipping integrated.
llama3.np is a pure NumPy implementation for Llama 3 model.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
A collective list of free APIs
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Lumina-T2X is a unified framework for Text to Any Modality Generation
Open-source super AI assistant & Agent Harness. Plans tasks, runs tools and skills, self-evolves with memory and knowledge. Multi-model, multi-channel. Lightweight, extensible, one-line install. (formerly chatgpt-on-wechat)
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
Python scraper based on AI
Devon: An open-source pair programmer
NO TIME TO SLEEP
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
The agent engineering platform.