GLM 5.2

GLM 5.2
Z.ai · Text Generation
/v1/chat/completions

Reasoning and coding model with a 1M token context, 128K output, adjustable reasoning effort, native web search, and tool calling.

At a glance

FieldValue
Model idglm-5-2
Input modalitiesText
Output modalitiesText
Context window1M
Weight precision-
Max output tokens131,072
Featuresreasoning, function_calling, structured_output, web_search
Native inferenceNo
NewYes
Supported endpoints/v1/chat/completions, /v1/responses, /v1/messages

Pricing

ChargeSpecRate
Inputper 1M prompt tokens$1.40
Outputper 1M generated tokens$4.40
Web Searchper request$0.033

Example request

$curl https://api.empiriolabs.ai/v1/chat/completions \
> -H 'Authorization: Bearer $EMPIRIOLABS_API_KEY' \
> -H 'Content-Type: application/json' \
> -d '{"model": "glm-5-2", "messages": [{"role":"user","content":"Hello"}]}'

Parameters

ParameterTypeRequiredDefaultDescription
max_tokensintegerno65536Maximum number of output tokens to generate. · Range: 1 – 131072
temperaturenumberno1Controls randomness. Lower values make responses more deterministic. · Range: 0 – 1
top_pnumberno0.95Nucleus sampling cutoff. · Range: 0.01 – 1
reasoning_effortenumno"max"GLM-5.2 reasoning effort. none disables thinking; minimal through max set how hard the model reasons before answering. max is recommended for complex coding. · Allowed: none, minimal, low, medium, high, xhigh, max
enable_thinkingbooleannotrueAllow the model to reason before answering. Turn off for the lowest-latency replies or strict structured output.
do_samplebooleannotrueEnable sampling. Turn off for greedy deterministic output (temperature and top_p are ignored).
tool_web_searchbooleannofalseEnable built-in web search. Adds $0.033 per request when used.
search_recency_filterenumno"noLimit"Limit web search results to a recency window. · Allowed: oneDay, oneWeek, oneMonth, oneYear, noLimit
countintegerno10Number of web search results to retrieve when web search is enabled. · Range: 1 – 50
search_domain_filterstringno-Restrict web search to a specific domain.
search_promptstringno-Optional prompt used to summarize retrieved web search results.
search_resultbooleannotrueReturn web search result metadata in the response when web search is enabled.
tool_streambooleannofalseStream function-call arguments incrementally when streaming.
toolsarrayno[]OpenAI-compatible function calling tool definitions.
tool_choiceobjectno-OpenAI-compatible tool choice control.
response_formatobjectno-OpenAI-compatible JSON mode. Use thinking disabled for strict structured output.
stoparrayno-Optional stop sequences (up to 4).

Machine-readable schema: GET https://api.empiriolabs.ai/v1/models/glm-5-2.