Wan2.7 Image

Wan2.7 Image
Alibaba Cloud · Image Generation
POST /v1/images/generations

Image generation and editing companion model: text-to-image, bounding-box edits, and cohesive image sets, with up to 4K output on Pro.

At a glance

FieldValue
Model idwan2-7-image
Input modalitiesText, Image
Output modalitiesImage
Context window-
Weight precision-
RegionSingapore
Featuresimage_editing, 4k
Native inferenceNo
NewNo
Supported endpointsPOST /v1/images/generations

Pricing

ChargeSpecRate
Standardper image$0.030
Proper image$0.075

Example request

$curl https://api.empiriolabs.ai/v1/images/generations \
> -H 'Authorization: Bearer $EMPIRIOLABS_API_KEY' \
> -H 'Content-Type: application/json' \
> -d '{"model": "wan2-7-image", "prompt": "a misty forest at dawn", "size": "1024x1024"}'

Parameters

ParameterTypeRequiredDefaultDescription
promptstringyes-Text prompt. For Image Set mode, describe each image in sequence (e.g. ‘First image: … Second image: …’).
model_variantenumno"wan2.7-image-pro"Pro: 4K + Thinking Mode + higher quality. Standard: faster, cheaper. · Allowed: wan2.7-image-pro, wan2.7-image
aspect_ratioenumno-Optional. If omitted, text-to-image defaults to 16:9; image-edit matches the last input image’s native ratio. · Allowed: 16:9, 1:1, 9:16, 3:2, 2:3, 4:3, 3:4, 5:4, 4:5
resolutionenumno"2K"Quality tier. 1K ~1MP, 2K ~4MP, 4K ~16MP (Pro text-to-image only). · Allowed: 1K, 2K, 4K
use_custom_sizebooleannofalseOverride aspect_ratio + resolution with explicit pixel dimensions.
custom_widthnumberno-Pixels (768–2048; Pro text-to-image up to 4096). Only when use_custom_size=true. Aspect must be in [1:8, 8:1]. · Range: 768 – 4096
custom_heightnumberno-Pixels (768–2048; Pro text-to-image up to 4096). Only when use_custom_size=true. · Range: 768 – 4096
enable_sequentialbooleannofalseImage Set Mode. Generate up to 12 cohesive related images. Disables thinking_mode and color_palette.
num_imagesnumberno1Standard mode only. Number of images to generate. Each is billed separately. · Range: 1 – 4
num_images_setnumberno1Image Set Mode only. Maximum images in the set. The model decides the actual count; you’re billed only for images generated. · Range: 1 – 12
thinking_modebooleannotruePro only. Enhanced inference for higher quality at the cost of latency. Auto-ignored when input images are attached or sequential mode is on.
imagearrayno-Up to 9 input image URLs for editing. Triggers image-edit mode automatically.
color_palettestringno-Optional JSON array of 3–10 {hex, ratio} objects (ratios sum to 100%). Standard mode only. Example: [{“hex”:“#C2D1E6”,“ratio”:“60.00%”},{“hex”:“#636574”,“ratio”:“40.00%”}]
bbox_liststringno-Interactive editing boxes as JSON. Use [] for no box, or [[x1,y1,x2,y2]] for one box.
seednumberno-Reproducibility seed. Leave empty for random. · Range: 0 – 2147483647
watermarkbooleannofalseForwarded to DashScope as parameters.watermark. When true, the upstream Wan service stamps a small Alibaba/Wan provenance watermark on each generated image. EmpirioLabs does not add any watermark of its own.
response_formatenumno"url"url (default) returns a signed URL; b64_json returns inline base64. · Allowed: url, b64_json

Notes

Upload 1-9 images for editing mode; without images, operates as text-to-image.

Constraints

  • Prompt max 5,000 characters
  • Generation: 10-60s depending on resolution, image count, model variant, and thinking mode

Image inputs (editing mode)

  • JPEG, PNG, BMP, or WEBP
  • 240-8000 px per side
  • Aspect ratio 1:8 to 8:1
  • Max 20 MB per image

Capabilities

  • Style transfer, object placement, scene blending, background replacement, multi-reference composition, interactive editing with bounding boxes
  • Pro variant supports up to 4K output

Machine-readable schema: GET https://api.empiriolabs.ai/v1/models/wan2-7-image.