It's getting harder to know what's real on the Internet, and Google is not helping one bit with the announcement of Veo 3.1. The company's new video model allegedly offers better audio and realism, along with greater prompt accuracy. The updated video AI will be available throughout the Google ecosystem, including the Flow filmmaking tool, where the new model will unlock additional features. And if you're worried about the cost of conjuring all these AI videos, Google is also adding a "Fast" variant of Veo.
Veo made waves when it debuted earlier this year, demonstrating a staggering improvement in AI video quality just a few months after Veo 2's release. It turns out that having all that video on YouTube is very useful for training AI models, so Google is already moving on to Veo 3.1 with a raft of new features.
Google says Veo 3.1 offers stronger prompt adherence, which results in better video outputs and fewer wasted compute cycles. Audio, which was a hallmark feature of the Veo 3 release, has allegedly gotten better, too. Veo 3's text-to-video was limited to 720p landscape output, but there's an ever-increasing volume of vertical video on the Internet. So Veo 3.1 can produce both landscape and portrait 16:9 video.
Google previously said it would bring Veo video tools to YouTube Shorts, which use a vertical video format like TikTok. The release of Veo 3.1 probably opens the door to fulfilling that promise. You can bet Veo videos will show up more frequently on TikTok as well now that it fits the format. This release also keeps Google in its race with OpenAI, which recently released a Sora iPhone app with an impressive new version of its video-generating AI.
A focus on filmmakers
The Veo 3.1 model will be available across Google's AI ecosystem. You'll be able to create content with Veo 3.1 and Veo 3.1 Fast via the Gemini app, and developers will have access in Vertex AI and through the Gemini API. Using the Fast variant will help keep costs down when paying per token. Presumably, users of the Gemini app will get more Fast video generations—we've asked Google about limits and will report if we hear back.
Veo is the underlying model in Google's Flow filmmaking tool, and it's getting a few new capabilities thanks to the updated model. The Ingredients to Video, Frames to Video, and Extend features are now all compatible with generated audio. So you can upload multiple images as a reference or use images as a starting or end point while also adding custom audio to the clip. These same capabilities are offered in the API, and the Gemini app continues to accept reference images for Veo outputs. The app doesn't get all the Flow features, though.
There are a couple of entirely new video features coming with Veo 3.1, too. Google says Veo 3.1 is better able to replicate the look of a video while making "precision" edits. So you'll be able to add an object to a clip while keeping the rest of it unchanged (more or less). Likewise, you can remove an element without changing the rest of the scene. Adding objects will be available in Flow and the API immediately. Removing objects won't be available in Flow just yet, but Google says that will be coming soon.
The new video model begins rolling out today, so make sure you use a skeptical eye when scrolling vertical videos.