Kokoro - an open-weight TTS model with 82 million parameters.

By admin , 21 五月, 2025

https://github.com/hexgrad/kokoro

Kokoro is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers comparable quality to larger models while being significantly faster and more cost-efficient. With Apache-licensed weights, Kokoro can be deployed anywhere from production environments to personal projects.

支持普通话,合成8个字2.5秒的短句耗时0.7秒。这个模型似乎真的是很快!可以普通话音调不太对,也不提供开源的训练代码。

来自AI的比较:

模型语言支持计算资源需求风格控制开源状态
Kokoro多语言低(CPU 可用)灵活开源
Tacotron 2单语言为主高(依赖 GPU)有限开源
VITS多语言中高较强开源
商业模型(如 Google WaveNet)多语言极高(云端服务)丰富闭源

标签

评论

Restricted HTML

  • 允许的HTML标签:<a href hreflang> <em> <strong> <cite> <blockquote cite> <code> <ul type> <ol start type> <li> <dl> <dt> <dd> <h2 id> <h3 id> <h4 id> <h5 id> <h6 id> <img src>
  • 自动断行和分段。
  • 网页和电子邮件地址自动转换为链接。
CAPTCHA
请输入"Drupal"
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.

最新内容

最新评论