Meta admits it scraped all Australian Facebook posts since 2007 to train its AI
Meta has admitted it used Facebook and Instagram publicposts for Australian users to train its Artificial Intelligence models, and has scraped information from as far back as 2007.
An Australian Parliamentary committee has heard that whilst European users can opt out thanks to GDPR laws, Australian customers are not given that choice.
Meta has denied using the information of anyone under 18, but did confirm it had used over a decade’s worth of data. The firm could not answer whether it has scraped the photos of children who are now adults (i.e. those who created their accounts as a child, but have since turned 18).
A turning tide
The process of ‘scraping’ is essential for the development of AI and is basically data harvesting from websites, extracting the information and feeding it back to a Large Language Models (LLMs) which learns from the data. This means that GDPR regulations are becoming troublesome for more and more LLMs such as ChatGPT, which collects data from all over the internet without consent from the original source.
Meta’s global privacy director Melinda Claybaugh sat before the inquiry and admitted that the company was forced to pause the launch of AI products in Europe due to a lack of certainty, and it has had to give European users an opt-out due to more robust privacy laws. Senator Shoebridge grilled the Meta representative,
“The truth of the matter is that, unless you consciously had set those posts to private, since 2007, Meta has just decided you will scrape all of the photos and all of the text from every public post on Instagram or Facebook that Australians have shared since 2007, unless there was a conscious decision to set them on private. But that’s actually the reality, isn’t it?”
Claybaugh replied, “Correct”. She added that users can set their posts to private now to prevent future scraping, but this would have no effect on the data already taken.
The realization seems to be creeping in for the public and for tech companies that training AI models requires such vast amounts of data that it is ‘impossible’ to do so without using copyrighted materials. Considering millions of user’s posts have been used without their consent, it looks like tech giants might face much stricter regulations in future.
Sign up to the TechRadar Pro newsletter to get all the top news, opinion, features and guidance your business needs to succeed!
Via The Guardian
More from TechRadar Pro
Meta has admitted it used Facebook and Instagram publicposts for Australian users to train its Artificial Intelligence models, and has scraped information from as far back as 2007. An Australian Parliamentary committee has heard that whilst European users can opt out thanks to GDPR laws, Australian customers are not given…
Recent Posts
- ‘Elon Musk said he thinks humanoid robots will be in many homes in three years, and I agree with him.’ I sat down with Jake Dyson to hear his predictions for AI and robotics in your home — and why you shouldn’t throw out your stick vac just yet
- LaCie 8big Pro5 review: I tested LaCie’s huge 256TB DAS solution, and it’s ideal for 8K video editing but it comes with a price tag that’s just as big
- EA’s Star Wars Zero Company drops August 27
- Buying your dad a tech gift or gadget for Father’s Day? You may want to wait until Prime Day, if possible
- Which Amazon Fire Stick do I need? A simple guide to the key differences
Archives
- June 2026
- May 2026
- April 2026
- March 2026
- February 2026
- January 2026
- December 2025
- November 2025
- October 2025
- September 2025
- August 2025
- July 2025
- June 2025
- May 2025
- April 2025
- March 2025
- February 2025
- January 2025
- December 2024
- November 2024
- October 2024
- September 2024
- August 2024
- July 2024
- June 2024
- May 2024
- April 2024
- March 2024
- February 2024
- January 2024
- December 2023
- November 2023
- October 2023
- September 2023
- August 2023
- July 2023
- June 2023