★ Reading this for free? Get 20 structured AI courses + per-chapter AI tutor — the first chapter of every course free, no card.Start free in 30 seconds
AI Models

Is XTTS v2 / Coqui TTS Free for Commercial Use? (2026)

June 20, 2026
8 min read
Local AI Master Research Team

Want to go deeper than this article?

Free account unlocks the first chapter of all 20 courses — RAG, agents, MCP, voice AI, MLOps, real GitHub repos.

📚AI Learning Path

Voice working locally? Build the whole pipeline. Whisper, TTS, and voice cloning wired into real projects — hands-on courses. First chapter free, no card.

Start free
Or own it for life — Lifetime $149, pay once

No. The XTTS v2 model weights are released under the Coqui Public Model License (CPML) 1.0.0, which explicitly permits only non-commercial use of the model and its audio output — and because Coqui Inc. shut down in January 2024, there is no longer anyone to sell you a commercial license. The confusing part is that the code (the Coqui TTS Python library) is MPL 2.0 and is fine for commercial use — it is the trained weights that are locked to non-commercial. If you need TTS or voice output in a paid product, SaaS, ad-supported video, or client work, the genuinely safe open picks are Kokoro (Apache-2.0), Chatterbox (MIT), and Piper (MIT).

This question comes up constantly because XTTS v2 sounds excellent and clones voices from a few seconds of audio, so people assume "open source on Hugging Face" means "free to ship." It does not. License and code license are two different things, and TTS is one of the worst categories for this trap. This guide separates the two cleanly, gives you a verified license table, and points you at models that actually let you make money.

Is XTTS v2 free for commercial use?

Short answer: no. The XTTS-v2 model card on Hugging Face ships a LICENSE.txt that is the Coqui Public Model License 1.0.0. The license defines what you may do as "non-commercial purposes" and states the restriction directly:

"Non-commercial purposes include any of the following uses of the model or its output, but only so far as you do not receive any direct or indirect payment arising from the use of the model or its output."

It goes further and blocks using the model to bootstrap a commercial product: "Use of the model to train other models for commercial use is not a non-commercial purpose." So you cannot use XTTS v2 to generate a dataset and then claim the downstream model is clean.

Back in 2023, Coqui did sell commercial XTTS licenses (there was a tier around 365 USD/year for companies under 1M USD in revenue or funding). That path is gone. Coqui the company shut down — the founders announced it in late 2023 and services went offline in January 2024. There is no sales team, no portal, and no one to issue or honor a commercial license today. Practically, that makes XTTS v2 non-commercial-only with no legal escape hatch. You can read the exact terms on the XTTS-v2 LICENSE.txt.

Reading articles is good. Building is better.

Free account = 20+ free chapters across 20 courses, with a per-chapter AI tutor. No card. Cancel anytime if you ever upgrade.

The code is MPL 2.0 — so why can't I use it commercially?

This is the single most common point of confusion, so it's worth being precise. There are two separate licenses in play:

  • The Coqui TTS library (the Python code) is licensed under MPL 2.0. MPL 2.0 is a permissive, business-friendly license — commercial use is fine. The code is still maintained by the community; an active maintained fork lives at idiap/coqui-ai-TTS (installable as the coqui-tts package).
  • The XTTS v2 weights (the trained model) are licensed under the Coqui Public Model License (CPML) — non-commercial only.

So "the code is open" is true and "the model is free for my product" is false, at the same time. You can ship a commercial app built on the MPL-2.0 TTS library, but you cannot ship it loaded with the CPML XTTS v2 weights. To use that library commercially you would pair it with weights that carry a permissive license — which is exactly what the table below is for. For a setup walkthrough of the model itself (non-commercial), see our Coqui TTS / XTTS-v2 local setup guide.

Local TTS license comparison (2026)

Here is the verified license status for the local TTS and voice-cloning models people actually run in 2026. "Commercial OK" means the model weights themselves carry a license that allows paid/commercial use — not just the code.

ModelWeights licenseCommercial use?Notes
XTTS v2 (Coqui)Coqui Public Model License (CPML)❌ NoCode is MPL 2.0; weights non-commercial. Coqui shut down Jan 2024 — no commercial license available
Kokoro-82MApache-2.0✅ YesTiny 82M model, ~327 MB, runs on CPU. v1.0 released Jan 27 2025
Chatterbox (Resemble AI)MIT✅ Yes0.5B model, released 2025, zero-shot cloning + emotion control, built-in watermark
Piper (Rhasspy / OHF-Voice)MIT (code)✅ Yes (check voice)Fast CPU/Raspberry Pi TTS. Code MIT; individual voice packs vary — verify each voice
F5-TTSCode MIT / weights CC-BY-NC-4.0❌ No (weights)Code is MIT, but the released checkpoints are non-commercial because of the Emilia training data
OpenVoice v2MIT✅ YesMyShell relicensed v2 to MIT (v1 was non-commercial); good for cloning + style transfer

Two entries deserve a flag because they trip people up. F5-TTS is frequently called "commercial-friendly" because its code is MIT — but the published weights are CC-BY-NC-4.0 (the maintainers say this is due to the Emilia in-the-wild dataset), so the official checkpoints are non-commercial. You would have to retrain on your own commercially-licensed data to ship F5-TTS in a product. And Piper is MIT for the engine, but the voices are a mix — most are permissive, but a few were trained on datasets with their own terms, so confirm the specific voice you ship.

What should I use instead of XTTS v2 for a commercial product?

Pick by what matters most for your use case:

  • Want the easiest clean license + low compute? Use Kokoro (Apache-2.0). It is an 82M-parameter model, the weights are about 327 MB, and it runs fast on a plain CPU. Apache-2.0 is about as permissive as it gets — ship it in a SaaS, an app, ad-supported videos, anything. The trade-off is no zero-shot voice cloning; you get a fixed set of high-quality voices (54 voices across 8 languages in v1.0). Full walkthrough in our Kokoro TTS local setup guide.
  • Want voice cloning under a clean license? Use Chatterbox (MIT). Resemble AI released it under MIT in 2025; it is a ~0.5B model with zero-shot cloning from a short reference clip plus emotion-exaggeration control, and it embeds a watermark on output. MIT means commercial use with no royalties or revenue share.
  • Want CPU-only, embedded, or Raspberry Pi narration? Use Piper (MIT). It is the lightest of the three and built for offline, low-power devices — just confirm your chosen voice pack's license before shipping.

If you specifically came for XTTS-style cloning, Chatterbox is the closest like-for-like replacement that you can actually monetize. For a broader hands-on comparison of cloning models, see our local AI voice cloning roundup and the dedicated F5-TTS setup guide (great for personal/research use, just not commercial off the shelf).

Reading articles is good. Building is better.

Free account = 20+ free chapters across 20 courses, with a per-chapter AI tutor. No card. Cancel anytime if you ever upgrade.

First-hand note: what running these locally feels like

We have run all four families on a single mid-range box (RTX 3060 12GB plus a recent desktop CPU), and the practical picture is approximate but consistent. Kokoro is the one that genuinely doesn't need a GPU — on CPU it generates a sentence faster than real time in our informal testing, which is why its Apache-2.0 license plus tiny footprint makes it the default commercial recommendation. XTTS v2 and Chatterbox both want a GPU to feel responsive; on the 3060 they produce a few seconds of cloned speech in roughly a second or two per short utterance (rough, hardware-dependent, not a controlled benchmark). The takeaway isn't the exact numbers — it's that license, not raw quality, is the deciding factor for a paid product, and the permissive models are good enough that you are not sacrificing much to stay legal.

Key Takeaways

  1. XTTS v2 is not free for commercial use. Its weights are under the Coqui Public Model License (non-commercial), and Coqui shut down in January 2024, so no commercial license can be bought.
  2. Code license is not model license. The Coqui TTS library is MPL 2.0 (commercial OK), but the XTTS v2 weights are CPML (non-commercial). Both are true at once.
  3. F5-TTS has the same split. MIT code, but CC-BY-NC-4.0 weights — the official checkpoints are non-commercial.
  4. The safe commercial picks are Kokoro (Apache-2.0), Chatterbox (MIT), and Piper (MIT). Kokoro for the cleanest license and CPU use, Chatterbox for cloning, Piper for embedded/low-power.
  5. Always verify the weights license on the model card, and for Piper verify the individual voice pack — open source on Hugging Face does not mean commercial-free.

Next Steps

🎯
AI Learning Path

Voice working locally? Build the whole pipeline.

Whisper, TTS, and voice cloning wired into real projects — hands-on courses. First chapter free, no card.

Or own it for life — Lifetime $149 $599, pay once

Liked this? 20 full AI courses are waiting.

From fundamentals to RAG, agents, MCP servers, voice AI, and production deployment with real GitHub repos. First chapter free, every course.

Reading now
Join the discussion

Local AI Master Research Team

Creator of Local AI Master. I've built datasets with over 77,000 examples and trained AI models from scratch. Now I help people achieve AI independence through local AI mastery.

Build Real AI on Your Machine

RAG, agents, NLP, vision, and MLOps - chapters across 20 courses that take you from reading about AI to building AI.

Want structured AI education?

20 courses, 495+ chapters, from $9. Understand AI, don't just use it.

AI Learning Path
More on Local Voice & Speech
See the full Coqui TTS & Local Voice AI guide.

Comments (0)

No comments yet. Be the first to share your thoughts!

📅 Published: June 20, 2026🔄 Last Updated: June 20, 2026✓ Manually Reviewed

Ready to Go Beyond Tutorials?

20 structured courses with hands-on chapters - build RAG chatbots, AI agents, and ML pipelines on your own hardware.

🎯
AI Learning Path

Go from reading about AI to building with AI

20 structured courses. Hands-on projects. Runs on your machine. Start free.

Or own it for life — Lifetime $149 $599, pay once

Was this helpful?

LM

Written by the Local AI Master Team

The team behind Local AI Master

We build Local AI Master around practical, testable local AI workflows: model selection, hardware planning, RAG systems, agents, and MLOps. The goal is to turn scattered tutorials into a structured learning path you can follow on your own hardware.

✓ Local AI Curriculum✓ Hands-On Projects✓ Open Source Contributor
📚
Free · no account required

Grab the AI Starter Kit — career roadmap, cheat sheet, setup guide

No spam. Unsubscribe with one click.

🎯
AI Learning Path

Voice working locally? Build the whole pipeline.

Whisper, TTS, and voice cloning wired into real projects — hands-on courses. First chapter free, no card.

Or own it for life — Lifetime $149 $599, pay once
Free Tools & Calculators