Document processing

A fast, modular Flask backend API featuring language translation, web scraping, PDF text extraction, YouTube transcript fetching, and LLM-powered chat, summarization, and essay generation via Together AI.

#AI translation #Web scraping #Document processing #Chatbot

x4000/MDManager

New 2026

A fast, portable desktop viewer (and editor) for trees of markdown documents.

#Document processing

ares0027/storycrafter_lite

New 2026

#Local LLM #Document processing

barnetwang/document_structuring

231

New 2026

This skill parses PDF and Word (`.docx`) documents, slices them into structured Markdown sections based on document headings, stores the segments in a local SQLite database, and supports full-text search, TOC viewing, chunk retrieval, and cascading deletion.

#Search #Document processing

GerardoBarrera/pdfmakerapi-mcp

New 2026

MCP server for PDFMakerAPI — turn a description into a shareable, editable PDF (invoices, certificates, reports, resumes) from Claude, Cursor, and other MCP clients.

#MCP #Document processing

cfitzgerald-pd/1pager

New 2026

AI is too verbose. 1pager condenses any convo/thought/workspace into a single 1-pager optimized for human review.

#NLP #Document processing

DENSLnetion/Binot

New 2026

A smart voice note-taking app powered by Google Gemini AI. Transcribe, summarize, and organize audio into structured Markdown notes.

#NLP #AI voice #Document processing

seanbaugh/obsidian-to-supernote

New 2026

Send Obsidian files to Supernote sync folder of your choice as a PDF

#Document processing

tonycletus/docshift

New 2026

Free, local-first PDF tools that run in the browser with no uploads, database, or external API.

#Document processing #Self-hosted

ColCh/paperless-ngx-skill

New 2026

Reusable CLI + agent skill for the Paperless-NGX REST API via openapi-to-cli, with urllib helpers for multipart upload and array-field PATCH. Fully env-driven, nothing hardcoded.

#AI skills #Document processing

Kulturban/supernote-obsidian-sync

New 2026

Sync Supernote handwritten notes to Obsidian with Mistral OCR

#Document processing

tamnd/yomi

New 2026

Read any web page, or a whole website, into clean Markdown

#Web scraping #Document processing

VishiATChoudhary/open-prism

New 2026

Open-source AI LaTeX compiler + PDF viewer — desktop clone of ChatGPT Prism

#Document processing #Self-hosted

cherifon/Privacy-PDF-Editor

New 2026

Self-hosted PDF editor — merge, anonymize & watermark PDFs locally · FastAPI + PDF.js + Docker

#Document processing #Self-hosted

andyhuo520/ppocrv6-studio

8712

New 2026

# 简短要点总结 1. 基于PP-OCRv6（Tiny/Small/Medium三版本）搭建本地OCR工作台 2. 苹果硅芯片支持CoreML硬件加速 3. 提供模型一键切换功能 4. 配套OmniDocBench评测能力

#Computer vision #Document processing

dave8172/doceval

New 2026

Eval harness for document extraction pipelines — field-level accuracy, failure taxonomy, cost tracking

#Document processing

BrandonDry/Omnivert

New 2026

Convert anything to Markdown — a Windows desktop GUI for Microsoft MarkItDown. Turn files, folders, URLs, and text into clean Markdown, no terminal required.

#Document processing

AI agent16.1k starsAI coding assistant6.7k starsAI skills5.9k starsSelf-hosted3.3k starsMCP2.9k starsAI infrastructure2.9k starsAI video generation2.8k starsCurated list2.6k starsAI workflow2.5k starsWorkflow automation2.3k stars

Document processing

Other topics