Nova Lite 2 | EmpirioLabs AI Docs

Amazon · Text Generation

POST /v1/chat/completions

Fast, cost-effective multimodal reasoning model for text, images, documents, and video on a 1M context (long docs and ~90 min clips).

At a glance

Field	Value
Model id	`nova-lite-2`
Model release date	2025-12-02
Input modalities	Text, Image, Video, Document
Output modalities	Text
Context window	1M
Weight precision	-
Max output tokens	32,000
Features	vision, function_calling, reasoning
Native inference	No
New	No
Structured output	JSON Mode
Batch API	35% off list price
Supported endpoints	`POST /v1/chat/completions`, `POST /v1/responses`, `POST /v1/messages`, `POST /v1beta/models/nova-lite-2:generateContent`
Alternate model ids	`amazon-nova-lite-2`, `amazon/nova-lite-2`, `us.amazon.nova-2-lite-v1:0`

Pricing

Charge	Spec	Rate
Input	per 1M prompt tokens	$0.38
Output	per 1M generated tokens	$3.16
Cached input	per 1M tokens	$0.2128
Web Search (Linkup)	per call when invoked	$0.013

Example request

$ curl https://api.empiriolabs.ai/v1/chat/completions \
>   -H 'Authorization: Bearer $EMPIRIOLABS_API_KEY' \
>   -H 'Content-Type: application/json' \
>   -d '{"model": "nova-lite-2", "messages": [{"role":"user","content":"Hello"}]}'

Parameters

Parameter	Type	Required	Default	Description
`temperature`	number	no	`0.7`	Sampling temperature. 0 = deterministic, 2 = maximum randomness. · Range: 0 – 2
`top_p`	number	no	`0.9`	Nucleus sampling probability mass. Lower = more focused. · Range: 0 – 1
`max_tokens`	number	no	`4096`	Maximum tokens in the response. · Range: 1 – 65536
`stop`	string	no	-	Up to 4 strings where the model will stop generating further tokens.
`enable_reasoning`	boolean	no	true	Enable the model’s reasoning mode. Slower but improves multi-step problems.
`enable_thinking`	boolean	no	true	Enable extended reasoning before the final answer. Alias of enable_reasoning.
`reasoning_effort`	enum	no	`"medium"`	Reasoning effort level (low \| medium \| high). Higher = more thinking time. · Allowed: `low`, `medium`, `high`
`reasoning`	string	no	-	Responses API reasoning object: {“effort”:“low\|medium\|high”}
`response_format`	enum	no	-	Return the output as a valid JSON object (JSON mode). Describe the fields you want in your prompt.
`web_search_linkup`	boolean	no	false	Optional web search powered by Linkup. When enabled, recent web sources are retrieved using your latest user message as the query and provided to the model as additional context. Adds $0.013 per call when invoked on top of the model’s normal token cost. Disabled by default.
`disable_formatting`	boolean	no	false	When enabled, the gateway will not append the “Sources” footer to assistant responses that used Linkup web search. Useful when the model output is piped to another system that expects no decoration.

Notes

Reasoning traces are NOT exposed from AWS. Video uploads up to ~1 GB.

Machine-readable schema: GET https://api.empiriolabs.ai/v1/models/nova-lite-2.