compose renders a video from a JSON timeline. Start with one clip, then layer up: more clips, transitions, overlays, and effects. This page builds from the smallest render to the full model. Each step adds one idea.
Your simplest render
The smallest timeline is one track with one clip. This trims three seconds of a video and renders it to MP4. The whole API call:Assets: video, image, text, audio, composition · Output: mp4 / webm / gif / mp3 (audio only) / jpg / png (still frame)
Join two clips
Put a second clip on the same track and a transition between them. The clips overlap by the transition’sduration, so the second clip starts a little before the first ends.
slide also accepts direction: "left" | "right" | "up" | "down".
| Transition | Effect |
|---|---|
crossfade | A dissolves directly into B |
fade | A dips through black, then B fades up |
slide | B slides in over A from an edge (directional) |
zoom | B grows in from the center |
How the timeline works
The model is two ideas:- Tracks stack. The first track is the background. Each later track composites on top of the ones below it: overlays, titles, picture-in-picture.
- Clips sit in time. Each clip has a
startand alengthin seconds. Two clips on the same track are joined by a transition. A gap between them plays nothing.
Add a layer
A second track renders over the first. This lays an animated title over the video.Effects on a clip
Effects are fields on a clip. Stack as many as you like on one clip. Each example below shows the source on the left and the rendered result on the right. Exact fields and ranges are in the Reference.Color and filters
color grades a clip with contrast, saturation, temperature, brightness, gamma, and hue. filter applies a one-shot look like greyscale or boost.
Motion and speed
transform positions, scales, and rotates a clip. Add animateTo for a Ken Burns move, or scale a clip down and place it on an overlay track for picture-in-picture. speed is a playback multiplier: below 1 for slow motion, above 1 for a timelapse.
Compositing
chromaKey keys out a solid background color so you can drop a presenter onto any scene. blendMode controls how an overlay blends with the track below. opacity (a number or keyframes) and blur handle fades and soft backdrops.
Text and titles
Atext asset renders a styled title. Set the font, size, weight, color, position, and an animate entrance (fade, slideUp, slideDown, slideLeft, slideRight). Put one text clip on its own track for a title card, another low and left for a lower-third name tag.
Set the canvas
output controls the resolution, frame rate, and format. A 9:16 canvas with a crop on the clip turns landscape footage into a vertical Reel that fills the frame, no black bars.
Scene-first authoring
Tracks give you precise multi-track control. For a sequential edit, you can instead describe a list of scenes. Each scene is a self-contained segment with its own clips and an optional transition into the next.overlays is a flat list of clips that sit over the whole composition, like a persistent logo or watermark.
start and length to span the whole scene. Every clip field works the same as in tracks mode. scenes and tracks are mutually exclusive: a timeline uses one or the other.
Reference
Output
Theoutput object controls the render target.
| Field | Values |
|---|---|
format | mp4 (default), webm, gif (no audio), mp3 (audio only), jpg, png (still frame) |
resolution | { width, height } (integer pixels) |
fps | number |
videoCodec | h264, vp9 (optional; per-format default otherwise) |
audioCodec | aac, opus, mp3 (optional) |
frameTime | second of the timeline to capture for jpg / png (default 0) |
Assets
A clip’sasset is one of:
type | Fields |
|---|---|
video | src, trim { from, to }, volume (default 1) |
image | src |
audio | src, trim, volume, fadeIn, fadeOut (seconds) |
text | text, style |
composition | timeline, a nested { tracks } rendered and composited as a single clip |
Clip fields
Every clip hasasset, start, length, plus any of:
| Field | What it does |
|---|---|
opacity | 0..1, or keyframes [{ time, value, easing: step | linear | smooth }] |
transform | position {x,y}, scale, rotate, anchor (center | topLeft), fit (contain | fill), animateTo (Ken Burns end state) |
crop | top / right / bottom / left, each a percent string like "10%" |
speed | playback rate (0.5 = slow motion, 2 = fast, and pitch shifts with speed) |
color | brightness (-1..1), contrast (0..4), saturation (0..3), gamma (0.1..10), hue (-180..180), temperature (1000..40000 K) |
filter | greyscale, negative, boost, muted, lighten, darken, contrast (mutually exclusive with color) |
blur / sharpen | gaussian blur strength / unsharp amount |
chromaKey | color (hex), similarity (0..1) |
blendMode | normal, multiply, screen, overlay, darken, lighten, add, difference |
pan | stereo balance, -1 (left) to 1 (right) |
flip | { horizontal, vertical } |
Transitions
Place between two clips on a track:{ "type": "transition", "transition": <kind>, "duration": <seconds>, "direction"?: <dir> }.
crossfade
A dissolves directly into B.
fade
A dips through black, then B fades up.
slide
B slides in from an edge. Set
direction: left, right, up, or down.zoom
B grows in from the center.
none
Hard cut, no blend.
Text style
Fields on a text asset’sstyle:
Type
font, size, weight (100..1000), color, align (left, center, right)Position
position as { x, y } percent stringsDecoration
stroke {color,width}, background, shadow {color,opacity,offsetX,offsetY}Entrance
animate {type, duration}, where type is fade, slideUp, slideDown, slideLeft, or slideRight