Text-to-video AI inches closer as startup Runway announces new model


Text-to-image AI is mainstream now, but just waiting in the wings is text-to-video. The pitch for this technology is that you’ll be able to type a description and generate a corresponding video in any style you like. Current capabilities lag behind this dream, but for those tracking the tech’s progress, an announcement today by AI startup Runway of a new AI video generation model is noteworthy nonetheless.
Runway offers a web-based video editor that specializes in AI tools like background removal and pose detection. The company helped develop open-source text-to-image model Stable Diffusion and announced its first AI video editing model, Gen-1, in February.
Gen-1 focused on transforming existing video footage, letting users input a rough 3D animation or shaky smartphone clip and apply an AI-generated overlay. In the clip below, for example, footage of cardboard packaging is paired with an image of an industrial factory to produce a clip that could be used for storyboarding or pitching a more polished feature.
Gen-2, by comparison, seems more focused on generating videos from scratch, though there are lots of caveats to note. First, the demo clips shared by Runway are short, unstable, and certainly not photorealistic, and second, access is limited. Bloomberg News reports that users will have to sign up to join a waitlist for Gen-2 via Runway’s Discord, and a spokesperson for the company, Kelsey Rondenet, told The Verge that Runway will be “providing broad access in the coming weeks.”
In other words, all we have to judge Gen-2 right now is a demo reel and a handful of clips (most of which were already being advertised as part of Gen-1).
Still, the results are fascinating, and the prospect of text-to-video AI is certainly intoxicating — promising both new creative opportunities and new threats for misinformation, etc. It’s also worth comparing Runway’s work with text-to-video research shared by behemoths like Meta and Google. The work by these companies is more advanced (their AI-generated clips are longer and more cohesive) but not in a way that necessarily reflects these firms’ massive resources. (Runway, by comparison, is only a 45-person team.)
In other words: startups continue to do exciting work in generative AI, including the still-unexplored territory of text-to-video. Watch for more soon, AI-generated or not.
Text-to-image AI is mainstream now, but just waiting in the wings is text-to-video. The pitch for this technology is that you’ll be able to type a description and generate a corresponding video in any style you like. Current capabilities lag behind this dream, but for those tracking the tech’s progress,…
Recent Posts
- BYD will accept liability if one of its self-parking cars crashes
- Squid Game: The Challenge season 3 is a win for Netflix, but one unhinged game from the K-drama can’t be replicated
- Researcher tricks ChatGPT into revealing security keys – by saying “I give up”
- Anker’s Laptop Power Bank Is on Sale Right Now for Prime Day (2025)
- The unholy alliance that killed the AI moratorium
Archives
- July 2025
- June 2025
- May 2025
- April 2025
- March 2025
- February 2025
- January 2025
- December 2024
- November 2024
- October 2024
- September 2024
- August 2024
- July 2024
- June 2024
- May 2024
- April 2024
- March 2024
- February 2024
- January 2024
- December 2023
- November 2023
- October 2023
- September 2023
- August 2023
- July 2023
- June 2023
- May 2023
- April 2023
- March 2023
- February 2023
- January 2023
- December 2022
- November 2022
- October 2022
- September 2022
- August 2022
- July 2022