How Prompt to Video works
You provide a script or voiceover audio. Captions generates a complete talking-head video using either your AI Twin (a digital avatar that looks and sounds like you) or one of Captionsâ built-in actors. A realistic person delivers your content, synced to your words. The output is a finished video with captions, B-roll, music, and motion graphics applied.Set up your AI Twin (optional but recommended)
If you want the avatar to look like you, set up your AI Twin before starting. Record a 1-minute calibration video and Captions generates your digital avatar. See the AI Twin setup guide for exact instructions.You can skip this and use one of Captionsâ built-in actors instead. Your AI Twin just makes the output look like you personally.
Create a no-camera video with Prompt to Video
Write your script or record a voiceover
Type your script directly into the script field, or record an audio voiceover to use as the driver. You can also import an existing audio file. The script or audio becomes the spoken content of the video.
Choose your actor
Select My AI Twin if youâve set one up, or choose from Captionsâ library of built-in actors. Preview each actor to find the right fit for your contentâs tone.
Choose an AI Edit style
Pick a style that controls the look of the final video: caption design, B-roll, music, transitions, and motion graphics. Browse the 95+ style options and select one that matches your content. You can use the same style across all your Mirage videos for visual consistency.
Generate your video
Tap Generate. Captions produces the complete talking-head video synced to your script or audio. Generation typically takes a couple of minutes depending on video length.
Review and adjust
Watch the output from start to finish. Use Co-editor to make quick changes by typing: âfix the caption on the word X,â âswap the B-roll at 0:15,â or âchange the music.â You can also edit manually in the timeline.
Tips
- Record your voiceover in a quiet space even though youâre not on camera. Clean source audio produces a better-synced, more natural result
- 60-90 seconds is the sweet spot for Prompt to Video; longer scripts can be split into separate videos
- Use the same actor and style across your videos to build a consistent visual identity
- If youâre not happy with the first output, try re-generating. Slight variations in timing and expression are normal
Whatâs next?
AI Twin Setup
Create a digital avatar that looks and sounds like you.
Prompt to Video docs
Full reference for the Prompt to Video feature.
AI Edit Workflow
The full AI Edit workflow for creators.
Co-editor
Chat to edit. Type what you want changed.

