Seedance 2.0 Pro

ByteDance · Video Generation

POST /v1/videos/generations

Multimodal video model for cinematic output from text, image, audio, or video inputs, with stable motion and consistent characters.

At a glance

Field	Value
Model id	`seedance-2-0-pro`
Model release date	2026-02-12
Input modalities	Text, Image, Video, Audio
Output modalities	Video
Context window	-
Weight precision	-
Region	Malaysia
Features	audio_sync, camera_control, character_consistency
Native inference	No
New	No
Supported endpoints	`POST /v1/videos/generations`
Alternate model ids	`bytedance/seedance-2.0-pro`, `seedance-2.0-pro`

Pricing

Charge	Spec	Rate
T2V/I2V 480P	per second	$0.139
T2V/I2V 720P	per second	$0.300
T2V/I2V 1080P	per second	$0.749
T2V/I2V 4K	per second	$1.555
Video Input 480P	per second	$0.342
Video Input 720P	per second	$0.736
Video Input 1080P	per second	$1.841
Video Input 4K	per second	$3.732

Example request

$ curl https://api.empiriolabs.ai/v1/videos/generations \
>   -H 'Authorization: Bearer $EMPIRIOLABS_API_KEY' \
>   -H 'Content-Type: application/json' \
>   -d '{"model": "seedance-2-0-pro", "prompt": "sunrise over the ocean", "duration": 6}'

Parameters

Parameter	Type	Required	Default	Description
`prompt`	string	yes	-	Scene description.
`mode`	enum	no	`"auto"`	auto: detect from inputs. t2v: text-to-video. i2v_first: animate first frame. i2v_both: morph between start (image) and end (image_end). reference: use image as visual reference. edit: edit attached video. extend: extend attached video. · Allowed: `auto`, `t2v`, `i2v_first`, `i2v_both`, `reference`, `edit`, `extend`
`resolution`	enum	no	`"720p"`	Video generation resolution. 4K is available on Seedance 2.0 Pro. · Allowed: `480p`, `720p`, `1080p`, `4k`
`aspect_ratio`	enum	no	`"adaptive"`	adaptive: derive from input image. · Allowed: `adaptive`, `16:9`, `9:16`, `1:1`, `4:3`, `3:4`, `21:9`
`custom_duration`	boolean	no	true	If false, the model decides clip length. If true, use the duration field.
`duration`	number	no	`5`	Clip length in seconds. Only used when custom_duration=true. · Range: 4 – 15
`generate_audio`	boolean	no	true	Generate native audio with the video.
`image`	string	no	-	Reference image URL.
`image_end`	string	no	-	End-frame image URL for i2v_both.
`video`	string	no	-	Reference video URL for edit / extend.
`negative_prompt`	string	no	`""`	What to avoid.

Notes

Multimodal video from text, images, audio, and video inputs. Native audio-video sync, strong motion stability, consistent character handling. Outputs up to 4K (3840x2160).

Tip

Pair with Seedream 5.0 Lite for the reference image first when targeting lifelike-face cohesion across multiple inputs.

4K output

4K outputs are native 10-bit H.265 (HEVC) for maximum quality, delivered in full on download and through the API. Browsers cannot decode 4K HEVC inline, so the playground plays a 1080p preview while Download gives the full 4K. Open 4K files in any HEVC-capable player or editor.

Uploaded media preprocessing

Video inputs are capped to 15 seconds for reference, edit, and extend workflows.
Uploaded video inputs are normalized to provider-compatible MP4 when needed.

Machine-readable schema: GET https://api.empiriolabs.ai/v1/models/seedance-2-0-pro.