Transcribe audio and generate speech on European GPUs.
Open-source models with custom voice cloning.
Your voice data never leaves the EU.
We run the Qwen3 ASR and TTS model families for speech recognition and synthesis. Multilingual, open weights, and optimized for production workloads. Custom voice cloning included.
All models run on modern Blackwell or newer chips for ideal performance. Free tier included on all models.
Speech APIs enable a wide range of applications. From transcription pipelines to voice-enabled products.
The Speech API follows the OpenAI Audio API format. Use the same endpoints and SDKs you already know.
Supports: /v1/audio/transcriptions, /v1/audio/speech. Custom voice via the voice parameter. Multiple audio formats.
5 minutes of transcription and synthesis per month. No credit card required.
Create free account