Qwen TTS focuses on on-device processing with no external API; emotion control relies on precise prompts, shaping output ...
KittenTTS brings small text to speech models to edge devices; the Nano 8-bit model is about 25 MB, local playback is possible.