No. The XTTS v2 model weights are released under the Coqui Public Model License (CPML) 1.0.0, which explicitly permits only non-commercial use of the model and its audio output — and because Coqui Inc. shut down in January 2024, there is no longer anyone to sell you a commercial license. The confusing part is that the code (the Coqui TTS Python library) is MPL 2.0 and is fine for commercial use — it is the trained weights that are locked to non-commercial. If you need TTS or voice output in a paid product, SaaS, ad-supported video, or client work, the genuinely safe open picks are Kokoro (Apache-2.0), Chatterbox (MIT), and Piper (MIT).

This question comes up constantly because XTTS v2 sounds excellent and clones voices from a few seconds of audio, so people assume "open source on Hugging Face" means "free to ship." It does not. License and code license are two different things, and TTS is one of the worst categories for this trap. This guide separates the two cleanly, gives you a verified license table, and points you at models that actually let you make money.

Is XTTS v2 free for commercial use?

Short answer: no. The XTTS-v2 model card on Hugging Face ships a LICENSE.txt that is the Coqui Public Model License 1.0.0. The license defines what you may do as "non-commercial purposes" and states the restriction directly:

"Non-commercial purposes include any of the following uses of the model or its output, but only so far as you do not receive any direct or indirect payment arising from the use of the model or its output."

It goes further and blocks using the model to bootstrap a commercial product: "Use of the model to train other models for commercial use is not a non-commercial purpose." So you cannot use XTTS v2 to generate a dataset and then claim the downstream model is clean.

Back in 2023, Coqui did sell commercial XTTS licenses (there was a tier around 365 USD/year for companies under 1M USD in revenue or funding). That path is gone. Coqui the company shut down — the founders announced it in late 2023 and services went offline in January 2024. There is no sales team, no portal, and no one to issue or honor a commercial license today. Practically, that makes XTTS v2 non-commercial-only with no legal escape hatch. You can read the exact terms on the XTTS-v2 LICENSE.txt.

Reading articles is good. Building is better.

Free account = 20+ free chapters across 20 courses, with a per-chapter AI tutor. No card. Cancel anytime if you ever upgrade.

Start free in 30 seconds See pricing

The code is MPL 2.0 — so why can't I use it commercially?

This is the single most common point of confusion, so it's worth being precise. There are two separate licenses in play:

The Coqui TTS library (the Python code) is licensed under MPL 2.0. MPL 2.0 is a permissive, business-friendly license — commercial use is fine. The code is still maintained by the community; an active maintained fork lives at idiap/coqui-ai-TTS (installable as the coqui-tts package).
The XTTS v2 weights (the trained model) are licensed under the Coqui Public Model License (CPML) — non-commercial only.

So "the code is open" is true and "the model is free for my product" is false, at the same time. You can ship a commercial app built on the MPL-2.0 TTS library, but you cannot ship it loaded with the CPML XTTS v2 weights. To use that library commercially you would pair it with weights that carry a permissive license — which is exactly what the table below is for. For a setup walkthrough of the model itself (non-commercial), see our Coqui TTS / XTTS-v2 local setup guide.

Local TTS license comparison (2026)

Here is the verified license status for the local TTS and voice-cloning models people actually run in 2026. "Commercial OK" means the model weights themselves carry a license that allows paid/commercial use — not just the code.

Model	Weights license	Commercial use?	Notes
XTTS v2 (Coqui)	Coqui Public Model License (CPML)	❌ No	Code is MPL 2.0; weights non-commercial. Coqui shut down Jan 2024 — no commercial license available
Kokoro-82M	Apache-2.0	✅ Yes	Tiny 82M model, ~327 MB, runs on CPU. v1.0 released Jan 27 2025
Chatterbox (Resemble AI)	MIT	✅ Yes	0.5B model, released 2025, zero-shot cloning + emotion control, built-in watermark
Piper (Rhasspy / OHF-Voice)	MIT (code)	✅ Yes (check voice)	Fast CPU/Raspberry Pi TTS. Code MIT; individual voice packs vary — verify each voice
F5-TTS	Code MIT / weights CC-BY-NC-4.0	❌ No (weights)	Code is MIT, but the released checkpoints are non-commercial because of the Emilia training data
OpenVoice v2	MIT	✅ Yes	MyShell relicensed v2 to MIT (v1 was non-commercial); good for cloning + style transfer

Two entries deserve a flag because they trip people up. F5-TTS is frequently called "commercial-friendly" because its code is MIT — but the published weights are CC-BY-NC-4.0 (the maintainers say this is due to the Emilia in-the-wild dataset), so the official checkpoints are non-commercial. You would have to retrain on your own commercially-licensed data to ship F5-TTS in a product. And Piper is MIT for the engine, but the voices are a mix — most are permissive, but a few were trained on datasets with their own terms, so confirm the specific voice you ship.

What should I use instead of XTTS v2 for a commercial product?

Pick by what matters most for your use case:

Want the easiest clean license + low compute? Use Kokoro (Apache-2.0). It is an 82M-parameter model, the weights are about 327 MB, and it runs fast on a plain CPU. Apache-2.0 is about as permissive as it gets — ship it in a SaaS, an app, ad-supported videos, anything. The trade-off is no zero-shot voice cloning; you get a fixed set of high-quality voices (54 voices across 8 languages in v1.0). Full walkthrough in our Kokoro TTS local setup guide.
Want voice cloning under a clean license? Use Chatterbox (MIT). Resemble AI released it under MIT in 2025; it is a ~0.5B model with zero-shot cloning from a short reference clip plus emotion-exaggeration control, and it embeds a watermark on output. MIT means commercial use with no royalties or revenue share.
Want CPU-only, embedded, or Raspberry Pi narration? Use Piper (MIT). It is the lightest of the three and built for offline, low-power devices — just confirm your chosen voice pack's license before shipping.

If you specifically came for XTTS-style cloning, Chatterbox is the closest like-for-like replacement that you can actually monetize. For a broader hands-on comparison of cloning models, see our local AI voice cloning roundup and the dedicated F5-TTS setup guide (great for personal/research use, just not commercial off the shelf).

Reading articles is good. Building is better.

Free account = 20+ free chapters across 20 courses, with a per-chapter AI tutor. No card. Cancel anytime if you ever upgrade.

Start free in 30 seconds See pricing

First-hand note: what running these locally feels like

We have run all four families on a single mid-range box (RTX 3060 12GB plus a recent desktop CPU), and the practical picture is approximate but consistent. Kokoro is the one that genuinely doesn't need a GPU — on CPU it generates a sentence faster than real time in our informal testing, which is why its Apache-2.0 license plus tiny footprint makes it the default commercial recommendation. XTTS v2 and Chatterbox both want a GPU to feel responsive; on the 3060 they produce a few seconds of cloned speech in roughly a second or two per short utterance (rough, hardware-dependent, not a controlled benchmark). The takeaway isn't the exact numbers — it's that license, not raw quality, is the deciding factor for a paid product, and the permissive models are good enough that you are not sacrificing much to stay legal.

Key Takeaways

XTTS v2 is not free for commercial use. Its weights are under the Coqui Public Model License (non-commercial), and Coqui shut down in January 2024, so no commercial license can be bought.
Code license is not model license. The Coqui TTS library is MPL 2.0 (commercial OK), but the XTTS v2 weights are CPML (non-commercial). Both are true at once.
F5-TTS has the same split. MIT code, but CC-BY-NC-4.0 weights — the official checkpoints are non-commercial.
The safe commercial picks are Kokoro (Apache-2.0), Chatterbox (MIT), and Piper (MIT). Kokoro for the cleanest license and CPU use, Chatterbox for cloning, Piper for embedded/low-power.
Always verify the weights license on the model card, and for Piper verify the individual voice pack — open source on Hugging Face does not mean commercial-free.

Next Steps

Need cloning you can sell? Read the local AI voice cloning comparison to see Chatterbox and OpenVoice v2 side by side.
Want the no-GPU, Apache-2.0 default? Follow the Kokoro TTS local setup guide.
Still want to try XTTS v2 for personal/research use? Our Coqui TTS setup guide covers install and voice cloning (non-commercial only).
Comparing the newest cloning model? See the F5-TTS setup guide — excellent quality, but check the weights license before any paid use.

Is XTTS v2 / Coqui TTS Free for Commercial Use? (2026)

Want to go deeper than this article?

Is XTTS v2 free for commercial use?

Reading articles is good. Building is better.

The code is MPL 2.0 — so why can't I use it commercially?

Local TTS license comparison (2026)

What should I use instead of XTTS v2 for a commercial product?

Reading articles is good. Building is better.

First-hand note: what running these locally feels like

Key Takeaways

Next Steps

Voice working locally? Build the whole pipeline.

Liked this? 20 full AI courses are waiting.

Local AI Master Research Team

Build Real AI on Your Machine

Want structured AI education?

Continue Your Local AI Journey

How to Install Your First Local AI Model

How to Choose the Right AI Model for Your Computer

Comments (0)

Ready to Go Beyond Tutorials?

Go from reading about AI to building with AI

Related Guides

Coqui TTS (XTTS-v2) Local Setup Guide

Kokoro TTS Local Setup (2026)

Local AI Voice Clone: 5 Open Models Tested

Written by the Local AI Master Team

Grab the AI Starter Kit — career roadmap, cheat sheet, setup guide

Voice working locally? Build the whole pipeline.