Qwen3 Max Preview

Qwen3 Max Preview
Alibaba Cloud · Text Generation
POST /v1/chat/completions

Preview release with major gains over the 2.5 series in Chinese-English understanding, complex instructions, multilingual ability, and tool use.

This model is deprecated and will be retired on 2026-09-08. After that date, requests to this model will fail. Migrate to a successor model before then.

At a glance

FieldValue
Model idqwen3-max-preview
Input modalitiesText
Output modalitiesText
Context window256K
Weight precision-
Max output tokens65,536
RegionSingapore
Featuresreasoning, code_interpreter, web_search
Native inferenceNo
NewNo
Supported endpointsPOST /v1/chat/completions, POST /v1/responses, POST /v1/messages
Deprecation date2026-09-08

Pricing

ChargeSpecRate
Inputper 1M prompt tokens<=32K $1.08 (was $1.20); 32K-128K $2.16 (was $2.40); 128K-256K $2.70 (was $3.00)
Outputper 1M generated tokens<=32K $4.80 (was $6.00); 32K-128K $9.60 (was $12.00); 128K-256K $12.00 (was $15.00)

Example request

$curl https://api.empiriolabs.ai/v1/chat/completions \
> -H 'Authorization: Bearer $EMPIRIOLABS_API_KEY' \
> -H 'Content-Type: application/json' \
> -d '{"model": "qwen3-max-preview", "messages": [{"role":"user","content":"Hello"}]}'

Parameters

ParameterTypeRequiredDefaultDescription
temperaturenumberno0.7Sampling temperature · Range: 0 – 2
top_pnumberno1.0Nucleus sampling · Range: 0 – 1
max_tokensnumberno4096Max output tokens · Range: 1 – 65536
frequency_penaltynumberno0Penalty for repeated tokens. >0 reduces repetition, <0 encourages it. · Range: -2 – 2
presence_penaltynumberno0Penalty for new vs. seen tokens. >0 encourages new topics, <0 encourages staying on topic. · Range: -2 – 2
stopstringno-Comma-separated stop sequences
enable_thinkingbooleannotrueModel thinks step-by-step before responding.
tool_code_interpreterbooleannofalseAllow the model to write and execute Python code.
disable_formattingbooleannofalseSkip the EmpirioLabs Markdown formatting (citation [N] rewriting + References block when web search / tools were used). The raw upstream answer with plain [N] citations is returned.

Notes

Deep thinking + code interpreter both available as opt-in toggles.


Machine-readable schema: GET https://api.empiriolabs.ai/v1/models/qwen3-max-preview.