Job Types

Each job in the queue has a type that determines which MetaHuman Animator pipeline it runs, what assets it requires, and which per-job options are available.

Mono

Mono Video

Processes a single color camera recording. This is the standard path for footage captured with a head-mounted camera or any single-lens setup. It is the fastest job type and the right default for most productions.

Required assets

AssetTypeNotes
Capture Data UFootageCaptureData Your video footage import asset
Identity UMetaHumanIdentity Auto-discovered if under a standard folder structure

Pipeline stages

Validate Input
Process Performance Tracking + Fitting
Export Outputs
Save Assets
job-type-mono-row.webp Queue row with Job Type set to "Mono" — Capture Data and Identity cells filled, status showing "Ready"
A Mono job row in the queue — Identity and Capture Data assigned, ready to process

Mono-specific options

Head Stabilization Corrects for head movement drift during the solve. Recommended when the subject moves significantly.
Tongue Solve Enables tongue tracking. Requires UE 5.7+ and appropriate capture quality. Disabled by default.
Stereo

Stereo HMC

Processes footage from a multi-camera head-mounted capture rig. Stereo jobs provide depth information that improves solve accuracy, particularly for facial geometry that is ambiguous in monocular footage. Depth sequences can be pre-generated or produced automatically by the plugin.

Required assets

AssetTypeNotes
Capture Data UFootageCaptureData Must reference both camera image sequences
Identity UMetaHumanIdentity Auto-discovered if under a standard folder structure
Camera Calibration UCameraCalibrationData Required for depth generation; auto-discovered if available

Pipeline stages

Validate Input
Generate Depth Stereo only
Clean Depth Optional
Process Performance
Export + Save
job-type-stereo-depth-gen.webp A Stereo job mid-process at the "Generate Depth" stage — progress bar visible, stage label highlighted
A Stereo job during depth generation — the stage before the performance solve
Depth generation is serial. Even when Max Concurrent Jobs is set above 1, depth generation for all stereo jobs completes before any job enters the concurrent processing pool.

Stereo-specific options

Solve Type Selects the stereo solver variant. Leave at default unless directed by Epic support.
Head Movement Reference Frame Frame number used as the reference pose for head movement correction.
Neutral Pose Calibration Uses a neutral expression frame for calibrating the face geometry baseline.
Face Coverage Threshold Minimum proportion of the face that must be visible in depth for a frame to be used.
Face Width Threshold Minimum face width in depth pixels required to include a frame.
Baseline / Scale Tolerance Acceptable deviation between the stereo baseline in calibration and the live footage.
Enable Depth Cleaning Runs NNE hole-filling on the depth sequence before the performance solve. Preserves GrayF EXR format.
Depth Clamp Min / Max Clips depth values outside this range before cleaning. Useful for eliminating background noise.
Audio

Audio Only

Drives facial animation from an audio track alone — no video footage required. Useful for generating lip-sync and expression animation from dialogue recordings when no performance capture footage is available.

Required assets

AssetTypeNotes
Capture Data UFootageCaptureData Must contain an audio track; video tracks are ignored
job-type-audio-row.webp Queue showing an Audio job row — only Capture Data assigned, Identity cell is greyed out / not required
An Audio job requires only Capture Data — Identity and Calibration columns are inactive
Audio jobs do not require a MetaHuman Identity or Camera Calibration asset. They are the simplest job type to configure.

Audio-specific options

Solver Mode Realtime (faster, lower quality) or Offline (slower, higher quality). Use Offline for final output.
Mood Preset Sets the emotional tone used to color the generated animation — Neutral, Happy, Sad, Angry, etc.
Blink Generation Auto-generates blinks when no blink data is present in the audio solve result.
Audio Channel Which channel from a multi-track audio asset to use for the solve (0-indexed).
Lookahead Window Number of frames the solver looks ahead in the audio signal for anticipation cues.

Universal Options

These options appear on every job regardless of type.

Frame Exclusion Ranges One or more frame ranges to skip during the solve. Use to cut out blinks, obstructions, or bad frames.
Show Frames When enabled, the solver renders preview frames during processing. Useful for debugging; slower in production.
Capture Frame Rate Override the frame rate detected from the capture data. Leave blank to use auto-detection.