MiniMax M2.7 | EmpirioLabs AI Docs

MiniMax · Text Generation

POST /v1/chat/completions

MiniMax M2.7 is a general-purpose reasoning chat model with interleaved thinking, function calling, and prompt caching.

At a glance

Field	Value
Model id	`minimax-m2-7`
Model release date	2026-03-18
Input modalities	Text
Output modalities	Text
Context window	200K
Weight precision	-
Max output tokens	32,768
Region	Singapore
Features	reasoning, function_calling, cache
Native inference	No
New	Yes
Structured output	JSON Mode
Supported endpoints	`POST /v1/chat/completions`, `POST /v1/responses`, `POST /v1/messages`, `POST /v1beta/models/minimax-m2-7:generateContent`

Pricing

Charge	Spec	Rate
Input	per 1M prompt tokens	$0.15 (was $0.30)
Output	per 1M generated tokens	$0.60 (was $1.20)
Implicit cache read	per 1M cached input tokens	$0.03 (was $0.06)
Web search	per request when enabled	$0.013

Example request

$ curl https://api.empiriolabs.ai/v1/chat/completions \
>   -H 'Authorization: Bearer $EMPIRIOLABS_API_KEY' \
>   -H 'Content-Type: application/json' \
>   -d '{"model": "minimax-m2-7", "messages": [{"role":"user","content":"Hello"}]}'

Parameters

Parameter	Type	Required	Default	Description
`temperature`	number	no	`1`	Sampling temperature. 0 = deterministic, 2 = maximum randomness. · Range: 0 – 2
`top_p`	number	no	`0.95`	Nucleus sampling probability mass. Lower = more focused. · Range: 0 – 1
`max_tokens`	number	no	`4096`	Maximum tokens in the response. · Range: 1 – 131072
`stop`	string	no	-	Up to 4 strings where the model will stop generating further tokens.
`tools`	array	no	-	OpenAI-style function-calling tool definitions. Each entry has name, description, parameters.
`tool_choice`	string	no	-	auto \| none \| required \| {type:function, function:{name:”…”}}. Controls when the model must call a tool.
`response_format`	enum	no	-	Return the output as a valid JSON object (JSON mode). Describe the fields you want in your prompt.
`web_search_linkup`	boolean	no	false	Optional web search powered by Linkup. When enabled, recent web sources are retrieved using your latest user message as the query and provided to the model as additional context. Adds $0.013 per call when invoked on top of the model’s normal token cost. Disabled by default.
`disable_formatting`	boolean	no	false	When enabled, the gateway will not append the “Sources” footer to assistant responses that used Linkup web search. Useful when the model output is piped to another system that expects no decoration.

Notes

Supports interleaved thinking, function calling, and implicit prompt cache reads. Thinking is always on and billed as output tokens.

Machine-readable schema: GET https://api.empiriolabs.ai/v1/models/minimax-m2-7.