Show HN: Voice Cloning and Multilingual TTS in One Click (Windows)

We've created an open-source alternative to Eleven Labs for voice cloning and multilingual TTS. Key features:

- Clone voices from 15-second samples - 50+ pre-trained celebrity voice models - Support for 100+ languages via Google Translator - Speech recognition with Whisper - One-click Windows installation - AI cover generation with pre-trained models

Demo videos showing podcast creation and multilingual dubbing: https://youtu.be/z8g8LMhoh_o (Podcast) https://youtu.be/ZtyhrZHbW0Y (Original) https://youtu.be/CA4WYdkJrkQ (English) https://youtu.be/hSEe0trPtnQ (Spanish) https://youtu.be/qwExW2sReNc (Chinese)

Try it: https://github.com/abus-aikorea/voice-pro


Comments URL: https://news.ycombinator.com/item?id=42836934

Points: 5

# Comments: 1

https://github.com/abus-aikorea/voice-pro/blob/main/docs/README.eng.md

созданный 1mo | 27 янв. 2025 г., 04:20:10


Войдите, чтобы добавить комментарий

Другие сообщения в этой группе

Open Source LLMOps Stack

Some background: I work on Langfuse and we've been collaborating with LiteLLM.

(LiteLLM is a Python library and proxy/gateway that handles cost management, virtual keys, caching, and rate-limiti

28 февр. 2025 г., 22:10:10 | Hacker news