Nova Pro 1.0 | EmpirioLabs AI Docs

Amazon · Text Generation

POST /v1/chat/completions

Multimodal foundation model balancing accuracy, speed, and cost for text, images, and video on 300K context (up to ~30 min video).

At a glance

Field	Value
Model id	`nova-pro-1-0`
Model release date	2024-12-03
Input modalities	Text, Image, Video
Output modalities	Text
Context window	300K
Weight precision	-
Max output tokens	5,000
Features	vision, function_calling
Native inference	No
New	No
Structured output	JSON Mode
Batch API	35% off list price
Supported endpoints	`POST /v1/chat/completions`, `POST /v1/responses`, `POST /v1/messages`, `POST /v1beta/models/nova-pro-1-0:generateContent`
Alternate model ids	`amazon-nova-pro`, `amazon.nova-pro-v1:0`, `amazon/nova-pro`, `nova-pro`, `us.amazon.nova-pro-v1:0`

Pricing

Charge	Spec	Rate
Input	per 1M prompt tokens	$2.40
Output	per 1M generated tokens	$9.60
Latency Optimized Input	per 1M prompt tokens	$3.00
Latency Optimized Output	per 1M generated tokens	$12.00
Web Search (Linkup)	per call when invoked	$0.013

Example request

$ curl https://api.empiriolabs.ai/v1/chat/completions \
>   -H 'Authorization: Bearer $EMPIRIOLABS_API_KEY' \
>   -H 'Content-Type: application/json' \
>   -d '{"model": "nova-pro-1-0", "messages": [{"role":"user","content":"Hello"}]}'

Parameters

Parameter	Type	Required	Default	Description
`temperature`	number	no	`0.7`	Sampling temperature. 0 = deterministic, 2 = maximum randomness. · Range: 0 – 2
`top_p`	number	no	`0.9`	Nucleus sampling probability mass. Lower = more focused. · Range: 0 – 1
`max_tokens`	number	no	`4096`	Maximum tokens in the response. · Range: 1 – 65536
`stop`	string	no	-	Up to 4 strings where the model will stop generating further tokens.
`latency_optimized`	boolean	no	false	Prefer faster routing at a slight quality cost.
`enable_latency_optimized`	boolean	no	false	Enable latency-optimized routing for faster responses, billed at the catalog Latency Optimized rate ( $3.00 input /$ 12.00 output per 1M tokens) instead of the Standard rate.
`response_format`	enum	no	-	Return the output as a valid JSON object (JSON mode). Describe the fields you want in your prompt.
`web_search_linkup`	boolean	no	false	Optional web search powered by Linkup. When enabled, recent web sources are retrieved using your latest user message as the query and provided to the model as additional context. Adds $0.013 per call when invoked on top of the model’s normal token cost. Disabled by default.
`disable_formatting`	boolean	no	false	When enabled, the gateway will not append the “Sources” footer to assistant responses that used Linkup web search. Useful when the model output is piped to another system that expects no decoration.

Notes

Optional Latency-Optimized Inference for faster responses at higher cost.

Machine-readable schema: GET https://api.empiriolabs.ai/v1/models/nova-pro-1-0.