base on AI-Powered Watermark Remover using Florence-2 and LaMA: Remove watermarks from images and videos, including AI-generated content from Sora, Runway, and others. Features a modern PyWebview GUI. # WatermarkRemover-AI **AI-Powered Watermark Removal Tool using Florence-2 and LaMA Models** 🇬🇧 English | 🇫🇷 Français | 🇨🇳 中文 | 🇯🇵 日本語 | 🇧🇷 Português | 🧠 Brainrot [![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](https://opensource.org/licenses/MIT) --- ## Overview `WatermarkRemover-AI` is a cutting-edge application that leverages AI models for precise watermark detection and seamless removal. Perfect for removing watermarks from AI-generated videos like Sora, Sora 2, Runway, and others. It uses Florence-2 from Microsoft for watermark identification and LaMA for inpainting to fill in the removed regions naturally. The software features a modern GUI built with PyWebview for an accessible and intuitive experience. ## Screenshot ![App Screenshot](assets/screenshot-preview.png) ## Demo https://github.com/user-attachments/assets/505be2a8-8eda-4def-90b6-5a4ceefee456 --- ## Features - **Smart Detection** - AI-powered watermark detection using Florence-2 - **Seamless Removal** - LaMA inpainting for natural-looking results - **Video Support** - Process videos with two-pass detection and audio preservation - **AI Video Ready** - Remove watermarks from Sora, Sora 2, Runway, and other AI-generated videos - **Batch Processing** - Handle entire folders at once - **Preview Mode** - Preview detected watermarks before processing - **Fade In/Out Handling** - Extend masks for watermarks that fade in/out - **GPU Acceleration** - CUDA support for faster processing - **Multi-Language UI** - Available in English, French, Chinese, Japanese, Portuguese, and more - **Themes** - Multiple UI themes to choose from --- ## Installation ### Windows The setup script downloads a portable Python environment automatically - no system Python required. ```powershell git clone https://github.com/D-Ogi/WatermarkRemover-AI.git cd WatermarkRemover-AI .\setup.ps1 ``` After setup, double-click `run.bat` to launch the app. ### Linux / macOS Requires Python 3.10+ installed on your system. ```bash git clone https://github.com/D-Ogi/WatermarkRemover-AI.git cd WatermarkRemover-AI chmod +x setup.sh ./setup.sh ``` After setup, run `./run.sh` to launch the app. ### Optional: FFmpeg Install FFmpeg to preserve audio when processing videos: - **Windows**: Download from [ffmpeg.org](https://ffmpeg.org/download.html) and add to PATH - **Linux**: `sudo apt install ffmpeg` - **macOS**: `brew install ffmpeg` --- ## Usage ### GUI Mode 1. Run the app (`run.bat` on Windows, `./run.sh` on macOS/Linux) 2. Select your preferred language and theme from the top-right corner 3. Select your mode (Single File or Batch) 4. Set input and output paths 5. Configure settings as needed 6. Hit **Start Processing** Your settings are automatically saved and restored on next launch. ### CLI Mode ```bash # Basic usage python remwm.py input.png output_folder/ # With options python remwm.py ./images ./output --overwrite --max-bbox-percent=15 --force-format=PNG # Process video with two-pass detection python remwm.py video.mp4 ./output --detection-skip=3 --fade-in=0.5 --fade-out=0.5 # Preview mode (detect without processing) python remwm.py input.png --preview ``` ### CLI Options | Option | Description | |--------|-------------| | `--overwrite` | Overwrite existing files | | `--transparent` | Make watermark regions transparent (images only) | | `--max-bbox-percent` | Max detection size as % of image (default: 10) | | `--force-format` | Force output format (PNG, WEBP, JPG, MP4, AVI) | | `--detection-prompt` | Custom detection prompt (default: "watermark") | | `--detection-skip` | Detect every N frames for videos (1-10, default: 1) | | `--fade-in` | Extend mask backwards by N seconds (for fade-in watermarks) | | `--fade-out` | Extend mask forwards by N seconds (for fade-out watermarks) | | `--preview` | Preview detected watermarks without processing | --- ## Video Processing - **Supported formats:** MP4, AVI, MOV, MKV, FLV, WMV, WEBM - **Audio preservation:** Requires FFmpeg installed - **Two-pass mode:** Faster processing with `--detection-skip` > 1 - **Fade handling:** Use `--fade-in` / `--fade-out` for watermarks that appear/disappear gradually --- ## Tech Stack - **Florence-2** - Microsoft's vision model for watermark detection - **LaMA** - Large Mask Inpainting model - **PyWebview** - Cross-platform webview wrapper - **Alpine.js** - Lightweight JavaScript framework for UI - **PyTorch** - Deep learning backend --- ## Contributing Contributions are welcome! Feel free to: 1. Fork the repository 2. Create a feature branch 3. Submit a pull request --- ## License This project is licensed under the MIT License. See the [LICENSE](LICENSE) file for details. --- ## Star History [![Star History Chart](https://api.star-history.com/svg?repos=D-Ogi/WatermarkRemover-AI&type=date&legend=top-left)](https://www.star-history.com/#D-Ogi/WatermarkRemover-AI&type=date&legend=top-left) ", Assign "at most 3 tags" to the expected json: {"id":"13555","tags":[]} "only from the tags list I provide: [{"id":39,"name":"3d-generation","display_name":"3D generation","slug":"3d-generation"},{"id":3,"name":"ai-agent","display_name":"AI agent","slug":"ai-agent"},{"id":8,"name":"ai-coding","display_name":"AI coding assistant","slug":"ai-coding"},{"id":5,"name":"ai-image","display_name":"AI image generation","slug":"ai-image"},{"id":9,"name":"ai-infrastructure","display_name":"AI infrastructure","slug":"ai-infrastructure"},{"id":10,"name":"ai-memory","display_name":"AI memory","slug":"ai-memory"},{"id":11,"name":"ai-skills","display_name":"AI skills","slug":"ai-skills"},{"id":12,"name":"ai-translation","display_name":"AI translation","slug":"ai-translation"},{"id":6,"name":"ai-video","display_name":"AI video generation","slug":"ai-video"},{"id":4,"name":"ai-voice","display_name":"AI voice","slug":"ai-voice"},{"id":7,"name":"ai-workflow","display_name":"AI workflow","slug":"ai-workflow"},{"id":22,"name":"audio-processing","display_name":"Audio processing","slug":"audio-processing"},{"id":29,"name":"authentication","display_name":"Authentication","slug":"authentication"},{"id":51,"name":"bundler","display_name":"Bundler","slug":"bundler"},{"id":41,"name":"chatbot","display_name":"Chatbot","slug":"chatbot"},{"id":27,"name":"cloud-native","display_name":"Cloud native","slug":"cloud-native"},{"id":1,"name":"computer-vision","display_name":"Computer vision","slug":"computer-vision"},{"id":37,"name":"crypto-trading","display_name":"Crypto trading","slug":"crypto-trading"},{"id":57,"name":"curated-list","display_name":"Curated list","slug":"curated-list"},{"id":54,"name":"data-streaming","display_name":"Data streaming","slug":"data-streaming"},{"id":35,"name":"data-visualization","display_name":"Data visualization","slug":"data-visualization"},{"id":16,"name":"database-backup","display_name":"Database backup","slug":"database-backup"},{"id":49,"name":"design-system","display_name":"Design system","slug":"design-system"},{"id":38,"name":"digital-human","display_name":"Digital human","slug":"digital-human"},{"id":34,"name":"document-processing","display_name":"Document processing","slug":"document-processing"},{"id":44,"name":"ecommerce","display_name":"E-commerce","slug":"ecommerce"},{"id":45,"name":"emulator","display_name":"Emulator","slug":"emulator"},{"id":46,"name":"file-management","display_name":"File management","slug":"file-management"},{"id":32,"name":"fintech","display_name":"Fintech","slug":"fintech"},{"id":31,"name":"game-development","display_name":"Game development","slug":"game-development"},{"id":24,"name":"headless-browser","display_name":"Headless browser","slug":"headless-browser"},{"id":52,"name":"headless-cms","display_name":"Headless CMS","slug":"headless-cms"},{"id":36,"name":"home-automation","display_name":"Home automation","slug":"home-automation"},{"id":20,"name":"image-editing","display_name":"Image editing","slug":"image-editing"},{"id":28,"name":"iot","display_name":"IoT","slug":"iot"},{"id":13,"name":"local-llm","display_name":"Local LLM","slug":"local-llm"},{"id":17,"name":"mcp","display_name":"MCP","slug":"mcp"},{"id":47,"name":"monitoring","display_name":"Monitoring","slug":"monitoring"},{"id":2,"name":"nlp","display_name":"NLP","slug":"nlp"},{"id":26,"name":"observability","display_name":"Observability","slug":"observability"},{"id":40,"name":"pentesting","display_name":"Pentesting","slug":"pentesting"},{"id":48,"name":"programming-examples","display_name":"Programming examples","slug":"programming-examples"},{"id":42,"name":"proxy","display_name":"Proxy","slug":"proxy"},{"id":14,"name":"rag","display_name":"RAG","slug":"rag"},{"id":56,"name":"resume-building","display_name":"Resume building","slug":"resume-building"},{"id":33,"name":"robotics","display_name":"Robotics","slug":"robotics"},{"id":30,"name":"search","display_name":"Search","slug":"search"},{"id":43,"name":"self-hosted","display_name":"Self-hosted","slug":"self-hosted"},{"id":50,"name":"static-analysis","display_name":"Static analysis","slug":"static-analysis"},{"id":18,"name":"synthetic-data","display_name":"Synthetic data","slug":"synthetic-data"},{"id":19,"name":"text-to-speech","display_name":"Text to speech","slug":"text-to-speech"},{"id":53,"name":"ui-components","display_name":"UI components","slug":"ui-components"},{"id":15,"name":"vector-database","display_name":"Vector database","slug":"vector-database"},{"id":21,"name":"video-editing","display_name":"Video editing","slug":"video-editing"},{"id":25,"name":"web-scraping","display_name":"Web scraping","slug":"web-scraping"},{"id":55,"name":"webassembly","display_name":"WebAssembly","slug":"webassembly"},{"id":23,"name":"workflow-automation","display_name":"Workflow automation","slug":"workflow-automation"}]" returns me the "expected json"