Captured photo
text2video-zero
560

About

Text2Video-Zero leverages pre-trained text-to-image diffusion models to create video sequences from text descriptions. The model works zero-shot, meaning it requires no video-specific fine-tuning, instead using latent code interpolation and cross-frame attention mechanisms. Ideal for rapid prototyping of video concepts from written prompts.

Percs

High quality
High-fidelity output suitable for professional use.
9:16 vertical
Native vertical output for social platforms.

Settings

Negative Prompt
Video Length-  Video length in seconds
FPS
Motion X
Motion Y
Seed