Home / Training Models / Phenaki
AI training-models

Phenaki
Free
AI Coding Tools
Text-to-video AI with creative, realistic synthesis
Phenaki screenShots

Overview of Phenaki
FAQs Of Phenaki
Phenaki is a model that can generate realistic videos from textual descriptions.
Phenaki can generate coherent long-form visual stories from a chain of prompts, with a core resolution of 128x128 pixels.
Phenaki addresses the challenges of generating videos from text by using two main components: an encoder-decoder model and a transformer model.
Phenaki uses causal attention in time for both the video encoder-decoder and the text encoder, which allows it to work with variable-length inputs and outputs.
Phenaki consists of two main components: an encoder-decoder model that compresses videos to discrete tokens, and a transformer model that translates text tokens to video tokens.
Phenaki can handle open-domain prompts that can change over time, such as “A teddy bear swimming in the ocean” or “An astronaut dancing on Mars”. Phenaki uses a bi-directional masked transformer to generate video tokens from text tokens, which can capture the temporal and semantic dependencies between the prompts.