Browse all topics
Text to speech
Open Source Speech Language Model
Talk to your Mac, query your docs, no cloud required. On-device voice AI + RAG
Allow your 🦞 bot to Shout, Speak, with "human" vibe
Give OpenClaw a voice — Let your agent speak from any Mac on your network
YumCut - free AI video generator to turn a prompt into ready vertical videos for TikTok, Reels and YouTube Shorts. Auto script, scenes, voiceover, subtitles and watermark. Built with Next.js. Local-first pipeline + templates, batch rendering and API hooks for creators and indie makers. Self-hosted, FFmpeg-ready, multi-language output. Low cost fast
Ming-omni-tts: Simple and Efficient Unified Generation of Speech, Music, and Sound with Precise Control
MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenarios, covering stable long‑form speech, multi‑speaker dialogue, voice/character design, environmental sound effects, and real‑time streaming TTS.
Official inference code for SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis
Run Qwen3-TTS text-to-speech locally on Mac (M1/M2/M3/M4). Voice cloning, voice design, custom voices. 100% offline using MLX.
MimikaStudio - A local-first application for macOS (Apple Silicon) + Agentic MCP Support
Awesome-Arabic-AI is a curated, professional-grade repository designed to be the central hub for the best open-source Arabic AI resources.
The open-source voice synthesis studio
A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.
A Simple Implementation of Qwen3-TTS's ComfyUI
Official code for "Habibi: Laying the Open-Source Foundation of Unified-Dialectal Arabic Speech Synthesis"
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice cloning.
Soprano-Factory: Train your own 2000x realtime text-to-speech model
Open-source text-to-speech for European languages with voice cloning
A TTS that fits in your CPU (and pocket)
Standalone desktop application for Text-to-Speech (TTS) utilizing the Kokoro-82M AI model for pdf files
Real-time YouTube Live Chat Text-to-Speech (TTS) using ElevenLabs AI voices
Other topics
Browse other topics on Trendshift
A
N