MOSS-Video-and-Audio

MOSS-Video-and-Audio

Provider: OpenMOSS
Category: Video Generation
Endpoint: POST /v1/videos/generations
Context window:
Served from: EmpirioLabs (Native Inference)

Open-source 32B MoE foundation model that generates synchronized video and audio in one inference step with precise dual-tower lip-sync.

At a glance

FieldValue
Model idmoss-video-and-audio
Input modalitiestext
Output modalitiesvideo
Context window
RegionEmpirioLabs (Native Inference)
Featuresaudio_sync, lipsync
NewNo
Native inferenceYes

Pricing

ChargeSpecRate
360p Videoper second$0.17
720p Videoper second$2.79
T2V Fastadditional fee$0.065
T2V Qualityadditional fee$0.13

Example request

$curl https://api.empiriolabs.ai/v1/videos/generations \
> -H 'Authorization: Bearer $EMPIRIOLABS_API_KEY' \
> -H 'Content-Type: application/json' \
> -d '{"model": "moss-video-and-audio", "prompt": "sunrise over the ocean", "duration": 6}'

Parameters

ParameterTypeRequiredDefaultDescription
promptstringyesScene description. With image attached, becomes an image-to-video prompt.
modeenumno"t2v"t2v: pure text-to-video. i2v: animate the attached image. · Allowed: t2v, i2v
resolutionenumno"720p"720p uses a separate higher-VRAM endpoint. · Allowed: 360p, 720p
aspect_ratioenumno"landscape"MOSS only supports landscape (16:9) and portrait (9:16). · Allowed: landscape, portrait
durationnumberno8Clip length in seconds. The upstream model is hard-capped at 8s. · Range: 2 – 8
t2v_qualityenumno"quality"Text-to-video only. fast trades fidelity for ~2× speed. · Allowed: fast, quality
num_inference_stepsnumberno25Diffusion steps. More = higher fidelity, slower. · Range: 10 – 50
cfg_scalenumberno5Classifier-free guidance. Higher = follows prompt more strictly. · Range: 1 – 10
sigma_shiftnumberno5Schedule shift. Only valid when resolution=360p. · Range: 1 – 10
imagestringnoReference image URL for i2v mode.
negative_promptstringno""What to avoid.
seednumbernoReproducibility seed.

Live machine-readable schema is also available at GET https://api.empiriolabs.ai/v1/models/moss-video-and-audio.