Nvidia out? DeepSeek pairs with banned Chinese tech giant to deliver unbelievably low pricing on AI inference which could cause Nvidia’s house of cards to come crashing


- DeepSeek’s V3 and R1 models are available through Huawei’s Ascend cloud service
- They are powered by the Ascend 910x accelerators banned in the US, EU and UK
- The pricing is much lower than offered by Azure and AWS who have started trialing DeepSeek
DeepSeek recently massively unsettled global markets with the launch of its open reasoning LLM, which was built and trained for a fraction of the cost of models from much larger US competitors, although OpenAI has since accused DeepSeek’s developers of using its models to train theirs.
A new paper had claimed DeepSeek’s V3 LLM was trained on a cluster of just 2,048 Nvidia H800 GPUs – crippled versions of the H100 designed to comply with US export restrictions to China. Rumors around DeepSeek’s newer reasoning model, R1, suggest it may have been trained on as many as 50,000 Nvidia “Hopper” GPUs, including H100, H800, and the newer H20, although DeepSeek hasn’t – and likely won’t – confirm this. If true, it raises serious questions about China’s access to advanced AI hardware despite ongoing trade restrictions, although it’s no secret there’s a thriving black market for advanced Nvidia AI hardware there.
Now, in a move that’s going to further shake Western firms, the South China Morning Post reports Huawei Technologies’ cloud computing unit has partnered with Beijing-based AI infrastructure start-up SiliconFlow to make DeepSeek’s models available to end users for an incredibly low price.
Powered by Huawei hardware
This collaboration, which was worked on during the Chinese Lunar New Year holidays, provides efficient, cost-effective access to DeepSeek’s V3 and R1 models through Huawei’s Ascend cloud service, which is powered by Huawei’s own homegrown solutions, including the controversial Ascend 910x accelerators which are banned in the US, UK and Europe.
Huawei has made no secret that it wants to become the Chinese Nvidia, and Huawei Cloud claims its performance levels are comparable to those of models running on premium global GPUs.
SiliconFlow, which hosts the DeepSeek models, has come out swinging with some aggressive pricing, offering it for 1 yuan (approximately US$0.13) per 1 million input tokens and 2 yuan for output tokens with V3, while R1 access is priced at 4 yuan and 16 yuan.
Microsoft added DeepSeek to its Azure AI Foundry a few days ago, and Amazon swiftly followed suit, adding the LLM to its AWS’ Bedrock managed service. AWS showcased the AI model using an ml.p5e.48xlarge instance, powered by eight Nvidia H200 GPUs delivering 1128GB of GPU memory. It’s early days for both cloud offerings though, and they work out much more expensive than SiliconFlow’s super-low pricing.
Sign up to the TechRadar Pro newsletter to get all the top news, opinion, features and guidance your business needs to succeed!
The collaboration between Huawei, SiliconFlow and DeepSeek highlights China’s broader strategy to strengthen its domestic AI capabilities while reducing reliance on Nvidia hardware.
The South China Morning Post notes, “The move to launch DeepSeek’s models on a homegrown hardware backbone highlights China’s progress in cutting dependency on foreign technology and bolstering its domestic AI industry amid growing efforts by the US to choke off China’s access to high-end chips that the US government said could be used to advance military aims.”
You might also like
DeepSeek’s V3 and R1 models are available through Huawei’s Ascend cloud service They are powered by the Ascend 910x accelerators banned in the US, EU and UK The pricing is much lower than offered by Azure and AWS who have started trialing DeepSeek DeepSeek recently massively unsettled global markets with…
Recent Posts
- With the Humane AI Pin now dead, what does the Rabbit R1 need to do to survive?
- One of the best AI video generators is now on the iPhone – here’s what you need to know about Pika’s new app
- Apple’s C1 chip could be a big deal for iPhones – here’s why
- Rabbit shows off the AI agent it should have launched with
- Instagram wants you to do more with DMs than just slide into someone else’s
Archives
- February 2025
- January 2025
- December 2024
- November 2024
- October 2024
- September 2024
- August 2024
- July 2024
- June 2024
- May 2024
- April 2024
- March 2024
- February 2024
- January 2024
- December 2023
- November 2023
- October 2023
- September 2023
- August 2023
- July 2023
- June 2023
- May 2023
- April 2023
- March 2023
- February 2023
- January 2023
- December 2022
- November 2022
- October 2022
- September 2022
- August 2022
- July 2022
- June 2022
- May 2022
- April 2022
- March 2022
- February 2022
- January 2022
- December 2021
- November 2021
- October 2021
- September 2021
- August 2021
- July 2021
- June 2021
- May 2021
- April 2021
- March 2021
- February 2021
- January 2021
- December 2020
- November 2020
- October 2020
- September 2020
- August 2020
- July 2020
- June 2020
- May 2020
- April 2020
- March 2020
- February 2020
- January 2020
- December 2019
- November 2019
- September 2018
- October 2017
- December 2011
- August 2010