Kimi K2.6

Kimi K2.6
Moonshot AI · Text Generation
POST /v1/chat/completions

Kimi K2.6 is a Moonshot multimodal reasoning model with 256K context, strong coding, and text, image, and video inputs.

At a glance

FieldValue
Model idkimi-k2-6
Input modalitiesText, Image, Video
Output modalitiesText
Context window256K
Weight precision-
Max output tokens16,000
RegionChina
Featuresreasoning, function_calling, cache, multimodal
Native inferenceNo
NewYes
Supported endpointsPOST /v1/chat/completions, POST /v1/responses, POST /v1/messages

Pricing

ChargeSpecRate
Inputper 1M prompt tokens$0.8939 (was $0.95)
Outputper 1M generated tokens$3.7131 (was $4.00)
Implicit cache readper 1M cached input tokens$0.1788

Example request

$curl https://api.empiriolabs.ai/v1/chat/completions \
> -H 'Authorization: Bearer $EMPIRIOLABS_API_KEY' \
> -H 'Content-Type: application/json' \
> -d '{"model": "kimi-k2-6", "messages": [{"role":"user","content":"Hello"}]}'

Parameters

ParameterTypeRequiredDefaultDescription
temperaturenumberno0.7Sampling temperature. 0 = deterministic, 2 = maximum randomness. · Range: 0 – 2
top_pnumberno0.9Nucleus sampling probability mass. Lower values make outputs more focused. · Range: 0 – 1
max_tokensnumberno4096Maximum output tokens. · Range: 1 – 16000
stopstringnoUp to 4 strings where the model will stop generating further tokens.
enable_thinkingbooleannotrueEnable reasoning before answering.
thinking_budgetnumberno32768Maximum tokens reserved for reasoning when thinking is enabled. · Range: 1 – 81920
reasoning_effortenumno"medium"Reasoning effort level. none disables thinking. low, medium, high, and max set bounded thinking budgets sized to the selected model. Sent as an OpenAI-style reasoning_effort field, translated into enable_thinking and thinking_budget for the model service. · Allowed: none, low, medium, high, max

Machine-readable schema: GET https://api.empiriolabs.ai/v1/models/kimi-k2-6.