Inworld

Inworld

Inworld

Models from Inworld

TTS 1.5 Max
TTS 1.5 Max
Broadcast-quality voice synthesis with rich expressive prosody, 271+ voices across 15 languages, and real-time SSE streaming with per-word timestamps.
TTS 1.5 Mini
TTS 1.5 Mini
Sub-130ms TTFB voice synthesis with 271+ voices across 15 languages, expressive prosody, and real-time SSE streaming for low-latency voice agents.