Show HN: Real-time local TTS (31M params, 5.6x CPU, voice cloning, ONNX) https://ift.tt/EkfXjyM

Show HN: Real-time local TTS (31M params, 5.6x CPU, voice cloning, ONNX) Hi guys and gals, I made a TTS model based on my highly upgraded VITS base, conditioned on external speaker embeddings (Resemble AI's Resemblyzer). The model, with ~31M parameters (ONNX), is tuned for latency and local inference, and comes already exported. I was trying to push the limits of what I could do with small, fast models. Runs 5.6x realtime on a server CPU It supports voice cloning, voice blending (mix two or more speakers to make a new voice), the license is Apache 2.0 and it uses DeepPhonemizer (MIT) for the phonemization, so no license issues. The repo contains the checkpoint, how to run it, and links to Colab and HuggingFace demos. Now, because it's tiny, audio quality isn't the best, and as it was trained on LibriTTS-R + VCTK (both fully open datasets), speaker similarity isn't as good. Regardless, I hope it's useful. https://ift.tt/IeVaywS March 18, 2026 at 11:48PM

हमरु उत्तराखण्ड

Show HN: Real-time local TTS (31M params, 5.6x CPU, voice cloning, ONNX) https://ift.tt/EkfXjyM

Post a Comment

0 Comments

Popular Posts

भरत नाट्य शास्त्र गढवाली अनुवाद

Show HN: I made a Telegram bot to get Raspberry Pi “in-stock” notification https://ift.tt/GtsFfAl

Show HN: Stratup.ai – Startup Idea Machine https://ift.tt/7RfCINq

Subscribe Us

Technology

Comments

Facebook

Categories

Menu Footer Widget

हमरु उत्तराखण्ड

Show HN: Real-time local TTS (31M params, 5.6x CPU, voice cloning, ONNX) https://ift.tt/EkfXjyM

You may like these posts

Post a Comment

0 Comments

Social Plugin

Popular Posts

भरत नाट्य शास्त्र गढवाली अनुवाद

Show HN: I made a Telegram bot to get Raspberry Pi “in-stock” notification https://ift.tt/GtsFfAl

Show HN: Stratup.ai – Startup Idea Machine https://ift.tt/7RfCINq

Subscribe Us

Technology

Comments

Facebook

Categories

Menu Footer Widget