Qwen3 Max Preview | EmpirioLabs AI Docs

POST /v1/chat/completions

Preview release with major gains over the 2.5 series in Chinese-English understanding, complex instructions, multilingual ability, and tool use.

This model is deprecated and will be retired on 2026-10-10. After that date, requests to this model will fail. Migrate to a successor model before then.

At a glance

Field	Value
Model id	`qwen3-max-preview`
Model release date	2025-09-05
Input modalities	Text
Output modalities	Text
Context window	256K
Weight precision	-
Max output tokens	65,536
Region	Singapore
Features	reasoning, code_interpreter, web_search
Native inference	No
New	No
Structured output	JSON Mode
Supported endpoints	`POST /v1/chat/completions`, `POST /v1/responses`, `POST /v1/messages`, `POST /v1beta/models/qwen3-max-preview:generateContent`
Deprecation date	2026-10-10

Pricing

Charge	Spec	Rate
Input	per 1M prompt tokens	<=32K $1.08 (was $1.20); 32K-128K $2.16 (was $2.40); 128K-256K $2.70 (was $3.00)
Output	per 1M generated tokens	<=32K $4.80 (was $6.00); 32K-128K $9.60 (was $12.00); 128K-256K $12.00 (was $15.00)

Example request

$ curl https://api.empiriolabs.ai/v1/chat/completions \
>   -H 'Authorization: Bearer $EMPIRIOLABS_API_KEY' \
>   -H 'Content-Type: application/json' \
>   -d '{"model": "qwen3-max-preview", "messages": [{"role":"user","content":"Hello"}]}'

Parameters

Parameter	Type	Required	Default	Description
`temperature`	number	no	`0.7`	Sampling temperature · Range: 0 – 2
`top_p`	number	no	`1.0`	Nucleus sampling · Range: 0 – 1
`max_tokens`	number	no	`4096`	Max output tokens · Range: 1 – 65536
`frequency_penalty`	number	no	`0`	Penalty for repeated tokens. >0 reduces repetition, <0 encourages it. · Range: -2 – 2
`presence_penalty`	number	no	`0`	Penalty for new vs. seen tokens. >0 encourages new topics, <0 encourages staying on topic. · Range: -2 – 2
`stop`	string	no	-	Comma-separated stop sequences
`enable_thinking`	boolean	no	true	Model thinks step-by-step before responding.
`tool_code_interpreter`	boolean	no	false	Allow the model to write and execute Python code.
`response_format`	enum	no	-	Return the output as a valid JSON object (JSON mode). Describe the fields you want in your prompt.
`disable_formatting`	boolean	no	false	Skip the EmpirioLabs Markdown formatting (citation [N] rewriting + References block when web search / tools were used). The raw upstream answer with plain [N] citations is returned.

Notes

Deep thinking + code interpreter both available as opt-in toggles.

Machine-readable schema: GET https://api.empiriolabs.ai/v1/models/qwen3-max-preview.