A Google DeepMind AI language model is now making descriptions for YouTube Shorts


Google just combined DeepMind and Google Brain into one big AI team, and on Wednesday, the new Google DeepMind shared details on how one of its visual language models (VLM) is being used to generate descriptions for YouTube Shorts, which can help with discoverability.
“Shorts are created in just a few minutes and often don’t include descriptions and helpful titles, which makes them harder to find through search,” DeepMind wrote in the post. Flamingo can make those descriptions by analyzing the initial frames of a video to explain what’s going on. (DeepMind gives the example of “a dog balancing a stack of crackers on its head.”) The text descriptions will be stored as metadata to “better categorize videos and match search results to viewer queries.”
This solves a real problem, Google DeepMind’s chief business officer Colin Murdoch tells The Verge: for Shorts, creators sometimes don’t add metadata because the process of creating a video is more streamlined than it is for a longer-form video. Todd Sherman, the director of product management for Shorts, added that because Shorts are mostly watched on a feed where people are just swiping to the next video instead of actively browsing for them, there isn’t as much incentive to add the metadata.
“This Flamingo model — the ability to understand these videos and provide us descriptive text — is just really so valuable for helping our systems that are already looking for this metadata,” Sherman says. “It allows them to more effectively understand these videos so that we can make that match for users when they’re searching for them.”
The generated descriptions won’t be user-facing. “We’re talking about metadata that’s behind the scenes,” Sherman says. “We don’t present it to creators, but there’s a lot of effort going into making sure that it’s accurate.” As for how Google is making sure these descriptions are accurate, “all of the descriptive text is going to align with our responsibility standards,” Sherman says. “It’s very unlikely that a descriptive text is generated that somehow frames a video in a bad light. That’s not an outcome that we anticipate at all.”
Flamingo is already applying auto-generated descriptions to new Shorts uploads
Flamingo is already applying auto-generated descriptions to new Shorts uploads, and it has done so for “a large corpus of existing videos, including the most viewed videos,” according to DeepMind spokesperson Duncan Smith.
I had to ask if Flamingo would be applied to longer-form YouTube videos down the line. “I think it’s completely conceivable that it could,” Sherman says. “I think that the need is probably a little bit less, though.” He notes that for a longer-form video, a creator might spend hours on things like pre-production, filming, and editing, so adding metadata is a relatively small piece of the process of making a video. And because people often watch longer-form videos based on things like a title and a thumbnail, creators making those have incentive to add metadata that helps with discoverability.
So I guess the answer there is that we’ll have to wait and see. But given Google’s major push to infuse AI into nearly everything it offers, applying something like Flamingo to longer-form YouTube videos doesn’t feel outside the realm of possibility, which could have a huge impact on YouTube search in the future.
Google just combined DeepMind and Google Brain into one big AI team, and on Wednesday, the new Google DeepMind shared details on how one of its visual language models (VLM) is being used to generate descriptions for YouTube Shorts, which can help with discoverability. “Shorts are created in just a…
Recent Posts
- Severance opens up a new kind of terror in latest episode
- The OLED TV I want to buy in 2025 is last year’s LG C4 – here’s why
- DJI’s drone-in-a-box can now launch from moving vehicles
- Best iPad Accessories (2025), Tested and Reviewed
- We might have our first look at the Samsung Galaxy Z Flip 7, but I can’t tell the difference from the Z Flip 6
Archives
- February 2025
- January 2025
- December 2024
- November 2024
- October 2024
- September 2024
- August 2024
- July 2024
- June 2024
- May 2024
- April 2024
- March 2024
- February 2024
- January 2024
- December 2023
- November 2023
- October 2023
- September 2023
- August 2023
- July 2023
- June 2023
- May 2023
- April 2023
- March 2023
- February 2023
- January 2023
- December 2022
- November 2022
- October 2022
- September 2022
- August 2022
- July 2022
- June 2022
- May 2022
- April 2022
- March 2022
- February 2022
- January 2022
- December 2021
- November 2021
- October 2021
- September 2021
- August 2021
- July 2021
- June 2021
- May 2021
- April 2021
- March 2021
- February 2021
- January 2021
- December 2020
- November 2020
- October 2020
- September 2020
- August 2020
- July 2020
- June 2020
- May 2020
- April 2020
- March 2020
- February 2020
- January 2020
- December 2019
- November 2019
- September 2018
- October 2017
- December 2011
- August 2010