Meta's AI video generator tool is already giving me nightmares
Meta's giving us more efficient video generation to bring motion to latent space.
Keep up to date with the most important stories and the best deals, as picked by the PC Gamer team.
You are now subscribed
Your newsletter sign-up was successful
Want to add more newsletters?
Every Friday
GamesRadar+
Your weekly update on everything you could ever want to know about the games you already love, games we know you're going to love in the near future, and tales from the communities that surround them.
Every Thursday
GTA 6 O'clock
Our special GTA 6 newsletter, with breaking news, insider info, and rumor analysis from the award-winning GTA 6 O'clock experts.
Every Friday
Knowledge
From the creators of Edge: A weekly videogame industry newsletter with analysis from expert writers, guidance from professionals, and insight into what's on the horizon.
Every Thursday
The Setup
Hardware nerds unite, sign up to our free tech newsletter for a weekly digest of the hottest new tech, the latest gadgets on the test bench, and much more.
Every Wednesday
Switch 2 Spotlight
Sign up to our new Switch 2 newsletter, where we bring you the latest talking points on Nintendo's new console each week, bring you up to date on the news, and recommend what games to play.
Every Saturday
The Watchlist
Subscribe for a weekly digest of the movie and TV news that matters, direct to your inbox. From first-look trailers, interviews, reviews and explainers, we've got you covered.
Once a month
SFX
Get sneak previews, exclusive competitions and details of special events each month!
Meta is offering an AI video generation service via Twitter right now called Make-A-Video. Although it looks pretty horrendous right now, the number of comments in just a day suggests that soon the AI image generation fad will be superseded by AI video generation. It's a big leap, with researchers pushing the boundaries of generative art as we know it, in particular how much data is necessary to bring images to life.
"With just a few words, this state-of-the-art AI system generates high-quality videos from text prompts," Meta AI writes in the tweet, and calls for prompts. The trick to keeping heaps of unregulated gore and porn from being generated and posted on Twitter? Send the prompt to them, and they might post the results.
We’re pleased to introduce Make-A-Video, our latest in #GenerativeAI research! With just a few words, this state-of-the-art AI system generates high-quality videos from text prompts.Have an idea you want to see? Reply w/ your prompt using #MetaAI and we’ll share more results. pic.twitter.com/q8zjiwLBjbSeptember 29, 2022
The alternative to waiting for the (likely scarred for life) Meta AI team to potentially select your prompt out of the thousands now piling into the comments is to head over to the Make-A-Video studio and sign up using the Google form to register your interest in the tool.
The accompanying research paper (PDF warning) calls the Make-A-Video process "an effective method that extends a diffusion-based T2I model to T2V through a spatiotemporally factorized diffusion model." That's a fancy way of saying they used an evolved version of diffusion's Text-to-Image generation model to make pictures move.
"While there is remarkable progress in T2I generation," the paper reads, "the progress of T2V generation lags behind largely due to two main reasons: the lack of large-scale datasets with high-quality text-video pairs, and the complexity of modelling higher-dimensional video data."
Best gaming monitor: Pixel-perfect panels for your PC
Best high refresh rate monitor: Screaming quick screens
Best 4K monitor for gaming: When only high-res will do
Best 4K TV for gaming: Big-screen 4K PC gaming
Essentially, the size and accuracy of the datasets needed to train current text to video AI models are just too vast to be viable.
The amazing thing about this evolution is that "it does not require paired text-video data," the paper notes. That's unlike many video and image generators out there that rely on galleries of content already paired with text. "This is a significant advantage compared to prior work," it explains, as it isn't as restricted and doesn't require as much data in order to work.
Keep up to date with the most important stories and the best deals, as picked by the PC Gamer team.
There are a few ways to use the tool, with it either filling in the motion between two images, simply adding motion to a single image, or creating new variations of a video based on the original. The results are fascinating. They're dreamy and psychedelic, and can be generated in a few different styles.
Sure these are a little spooky, especially when you remember that the results are only going to get more realistic, but a little hike through uncanny valley never hurts on the lead up to Halloween.

Having been obsessed with game mechanics, computers and graphics for three decades, Katie took Game Art and Design up to Masters level at uni and has been writing about digital games, tabletop games and gaming technology for over five years since. She can be found facilitating board game design workshops and optimising everything in her path.

