DeepYard
Q

Qwen3 TTS

21

Open-source TTS model with 12Hz audio generation and voice cloning capabilities

Open Sourceopen-source

About

Qwen3 TTS is a 1.7B parameter text-to-speech model offering high-quality 12Hz audio synthesis with custom voice cloning features. With 831K downloads on HuggingFace, it enables developers to generate natural-sounding speech while preserving speaker characteristics. Ideal for applications requiring personalized voice generation without API dependencies.

Details

Modalitytext
Release DateFeb 20, 2026
API AvailableYes
Hostingapi

Tags

open-weightaudioself-hostedfine-tunableapi