Yandex caught scraping Google SEO code


As TechRadar Pro reported earlier in January 2023, a former Yandex employee with a “political” motive has allegedly leaked a wide-ranging repository of source code for many of the web portal’s products, potentially shedding light on the dark art of search engine optimization.
BleepingComputer (opens in new tab) reports the employee leaked git sources totalling 44.7GB of files, containing “all of” Yandex’s source code except for its anti-spam rules, that were obtained in July 2022.
The raw source code won’t be of interest to everyone, Search Engine Land (opens in new tab)‘s report that 17,854 search ranking factors have been uncovered as part of the leak should be of interest to any person, business or publication looking to see their pages ranked highly in search engines.
Yandex leak SEO insights
A partial list of factors ranked by the Yandex search engine from one file in the codebase, shared by CEO of SEO consultancy MOG Media Martin MacDonald, does shed some light on the aspects of copy that Yandex applies weight to.
Per Russian Search News (opens in new tab), these include PageRank and several aspects of links such as age and relevancy, the perceived relevance of copy, host-reilability, and innate preferences towards specific sites with perceived authority, such as Wikipedia.
A deeper, longer, more technical dive by Search Engine Land (opens in new tab) also shows that this priority also includes a “NEWS_AGENCY_RATING”, allowing Yandex’ search engine to show preference to certain news organizations.
Others include the number of unique visitors, percentages of organic traffic, and average domain rankings across queries.
However, it’s perhaps melodramatic, or a little desolate, for MacDonald to describe it as “the most interesting thing to have happened in SEO in years.”
While the leaked codebase certainly offers a raft of insights, it’s worth noting that many websites will be looking to rank well on Google over Yandex, purely because the former is far better known.
Both companies have shared web engineers over the years, Yandex does use many of Google’s open source technologies, such as TensorFlow and BERT, and references to Google data appear in the leaked codebase.
However, Search Engine Land’s deep dive argues that the Yandex leak can give general insight into the anatomy of a modern search engine, but, per Russian Search News, many of the Yandex’ leaked ranking search factors go unused, or are officially considered depreciated.
Even the technical deep dive admits many of Google (the search engine’s) known aspects, such as its crawler and index systems, differ from Yandex’.
All of this, combined with the age of the leaked codebase, makes it unclear as to how assumptions over how Yandex and Google may both rank pages will fare.
Audio player loading… As TechRadar Pro reported earlier in January 2023, a former Yandex employee with a “political” motive has allegedly leaked a wide-ranging repository of source code for many of the web portal’s products, potentially shedding light on the dark art of search engine optimization. BleepingComputer (opens in new…
Recent Posts
- Apple’s C1 chip could be a big deal for iPhones – here’s why
- Rabbit shows off the AI agent it should have launched with
- Instagram wants you to do more with DMs than just slide into someone else’s
- Nvidia is launching ‘priority access’ to help fans buy RTX 5080 and 5090 FE GPUs
- HPE launches slew of Xeon-based Proliant servers which claim to be impervious to quantum computing threats
Archives
- February 2025
- January 2025
- December 2024
- November 2024
- October 2024
- September 2024
- August 2024
- July 2024
- June 2024
- May 2024
- April 2024
- March 2024
- February 2024
- January 2024
- December 2023
- November 2023
- October 2023
- September 2023
- August 2023
- July 2023
- June 2023
- May 2023
- April 2023
- March 2023
- February 2023
- January 2023
- December 2022
- November 2022
- October 2022
- September 2022
- August 2022
- July 2022
- June 2022
- May 2022
- April 2022
- March 2022
- February 2022
- January 2022
- December 2021
- November 2021
- October 2021
- September 2021
- August 2021
- July 2021
- June 2021
- May 2021
- April 2021
- March 2021
- February 2021
- January 2021
- December 2020
- November 2020
- October 2020
- September 2020
- August 2020
- July 2020
- June 2020
- May 2020
- April 2020
- March 2020
- February 2020
- January 2020
- December 2019
- November 2019
- September 2018
- October 2017
- December 2011
- August 2010