Tumblr’s owner is striking deals with OpenAI and Midjourney for training data, says report
The owner of Tumblr and WordPress.com is in talks with AI companies Midjourney and OpenAI to provide training data scraped from users’ posts, a report from 404 Media alleges. The report, based on an anonymous source inside the company, says that deals between Automattic and the two AI companies are “imminent.” It follows nebulous rumors that have spread on Tumblr over the past week, suggesting a deal with Midjourney could provide a new revenue stream for the site.
According to 404’s report, Automattic plans to launch a new setting Wednesday that will “allow users to opt-out of data sharing with third parties, including AI companies.” But it cites internal posts that suggest the company scraped an “initial data dump” containing “all Tumblr’s public post content between 2014 and 2023,” including — apparently by mistake — content that wouldn’t be publicly visible on blogs. It’s unclear what was done with this data and what data (if any) has been sent to Midjourney and OpenAI.
OpenAI and Midjourney did not immediately respond to requests for comment from The Verge. Automattic directed us to a public statement it published on Tuesday following 404’s report. The post, titled “Protecting User Choice,” alludes to partnerships with unnamed AI companies. “We currently block, by default, major AI platform crawlers — including ones from the biggest tech companies — and update our lists as new ones launch,” it says, and “will share only public content that’s hosted on WordPress.com and Tumblr from sites that haven’t opted out.” It goes on to note that “we are also working directly with select AI companies as long as their plans align with what our community cares about: attribution, opt-outs, and control.”
A number of companies have struck deals with AI tool makers to provide training data — which has historically been scraped from publicly available online data, a process that’s become legally riskier in recent years. Reddit reportedly has a $60 million annual deal with Google, while Shutterstock has signed a deal with OpenAI to train on its photo library. But a number of artists and writers — in other words, the creative community that Tumblr in particular caters to — have protested their work being used for training. Companies have struggled to walk a line between satisfying users and experimenting with new AI tools, leading to backlash against online spaces like DeviantArt that have flirted with the tech.
For now, there’s not much information about what any deal would entail, nor how much Automattic stands to gain from it. The company has a long-standing web hosting business with WordPress.com and WordPress VIP, both built on the open-source WordPress software. But it’s struggled with a variety of methods for monetizing Tumblr — which it acquired from Verizon in 2019 — and announced that it would downscale its ambitions for the site last year.
Update 3:50PM ET: Added statement from Automattic.
The owner of Tumblr and WordPress.com is in talks with AI companies Midjourney and OpenAI to provide training data scraped from users’ posts, a report from 404 Media alleges. The report, based on an anonymous source inside the company, says that deals between Automattic and the two AI companies are…
Recent Posts
- The Dyson HushJet Mini Cool is the powerful personal fan you won’t want to live without this summer — and it’s surprisingly reasonably priced, too
- Gone in 60 minutes
- GroWell Cap Review: I Have Hair for the First Time in 15 Years
- The Sonos Era 100 speaker is down to its lowest price in months
- Google shuts down the AI image app Pixel Studio
Archives
- June 2026
- May 2026
- April 2026
- March 2026
- February 2026
- January 2026
- December 2025
- November 2025
- October 2025
- September 2025
- August 2025
- July 2025
- June 2025
- May 2025
- April 2025
- March 2025
- February 2025
- January 2025
- December 2024
- November 2024
- October 2024
- September 2024
- August 2024
- July 2024
- June 2024
- May 2024
- April 2024
- March 2024
- February 2024
- January 2024
- December 2023
- November 2023
- October 2023
- September 2023
- August 2023
- July 2023
- June 2023