Generate speech | EmpirioLabs AI Docs

Text-to-speech, music generation, and multi-speaker podcast TTS share this endpoint. Returns a hosted URL by default; pass response_format: "b64_json" for inline audio bytes.

Authentication

AuthorizationBearer

Pass your EmpirioLabs API key as a bearer token. The Anthropic-style x-api-key header is also accepted on every endpoint.

Request

This endpoint expects an object.

modelstringRequired

inputstringOptional

Script / lyrics. Use [S1] / [S2] tags for multi-speaker models.

promptstringOptional

Music generation models use prompt instead of input.

voicestringOptional

output_formatstringOptional

durationintegerOptional

Music generation only; output length in seconds.

Response

Audio response (URL by default, or inline bytes).

datalist of objectsOptional

$	curl -X POST https://api.empiriolabs.ai/v1/audio/speech \
>	-H "Authorization: Bearer <token>" \
>	-H "Content-Type: application/json" \
>	-d '{
>	"model": "soulx-podcast",
>	"input": "[S1] Welcome to the show. [S2] Glad to be here. [S1] Lets dive in.",
>	"output_format": "mp3",
>	"voice_s1": "arthur",
>	"voice_s2": "lj"
>	}'

1	{
2	"data": [
3	{
4	"url": "string",
5	"duration": 1.1,
6	"format": "mp3"
7	}
8	]
9	}