Microsoft backed a tiny hardware startup that just launched its first AI processor that does inference without GPU or expensive HBM memory and a key Nvidia partner is collaborating with it


- Microsoft-backed startup introduces GPU-free alternatives for generative AI
- DIMC architecture delivers an ultra-high memory bandwidth of 150 TB/s
- Corsair supports transformers, agentic AI, and interactive video generation
d-Matrix Inc., a hardware startup based in Santa Clara, California, has introduced its first AI processor, Corsair, which is aimed at enhancing AI inference.
Backed by Microsoft and leveraging cutting-edge technology, Corsair eschews traditional GPUs and expensive high-bandwidth memory (HBM), delivering significant performance and cost benefits.
Corsair is currently available to early-access customers, with broader availability planned for the second quarter of 2025.
Corsair’s performance redefines AI inference
The Corsair processor is purpose-built to handle demanding AI inference tasks, particularly for generative AI models. For example, it achieves 60,000 tokens per second at 1 ms per token when running Llama3 8B in a single server.
In more resource-intensive scenarios, such as with Llama3 70B models, Corsair delivers 30,000 tokens per second at 2 ms per token in a single rack, translating into substantial savings in energy and operational costs compared to traditional GPU-based solutions.
The processor is built on Nighthawk and Jayhawk II tiles, using a 6nm manufacturing process. Each Nighthawk tile integrates four neural cores and a RISC-V CPU, tailored to support large-model inference with digital in-memory computation (DIMC) and versatile datatype processing, including block floating point (BFP).
Corsair adopts chiplet packaging, integrating memory and computation to maximize efficiency. It conforms to the industry-standard PCIe Gen5 full height full-length card form factor and can be paired with DMX Bridge cards for scalable performance. Each card is powered by 2400 TFLOPs of 8-bit peak computing, along with 2GB of integrated performance memory and up to 256GB of off-chip memory capacity.
Sign up to the TechRadar Pro newsletter to get all the top news, opinion, features and guidance your business needs to succeed!
It is important to note that Micron Technology, a key partner of Nvidia, is also collaborating with d-Matrix.
Initially set to launch in late 2023, d-Matrix reconfigured its architecture in response to the surging demand for generative AI. This pivot allowed Corsair to incorporate enhancements tailored for transformer models and emerging applications like agentic AI and interactive video generation.
“We saw transformers and generative AI coming, and founded d-Matrix to address inference challenges around the largest computing opportunity of our time,” said Sid Sheth, cofounder and CEO of d-Matrix.
“The first-of-its-kind Corsair compute platform brings blazing fast token generation for high interactivity applications with multiple users, making Gen AI commercially viable,” Sheth added.
Via eeNews
You may also like
Microsoft-backed startup introduces GPU-free alternatives for generative AI DIMC architecture delivers an ultra-high memory bandwidth of 150 TB/s Corsair supports transformers, agentic AI, and interactive video generation d-Matrix Inc., a hardware startup based in Santa Clara, California, has introduced its first AI processor, Corsair, which is aimed at enhancing AI…
Recent Posts
- The shape of things to come? Nvidia’s super fast 800GBps SuperNIC card spied and this Connect X-8 AIB vaguely resembles a GPU
- Two AI chatbots speaking to each other in their own special language is the last thing we need
- Samsung’s 9100 PRO SSD line includes its first 8TB NVMe model for consumers
- Sonos speakers and soundbars are 25 percent off for existing customers
- Xbox Cloud Gaming will let you invite friends with just a link
Archives
- February 2025
- January 2025
- December 2024
- November 2024
- October 2024
- September 2024
- August 2024
- July 2024
- June 2024
- May 2024
- April 2024
- March 2024
- February 2024
- January 2024
- December 2023
- November 2023
- October 2023
- September 2023
- August 2023
- July 2023
- June 2023
- May 2023
- April 2023
- March 2023
- February 2023
- January 2023
- December 2022
- November 2022
- October 2022
- September 2022
- August 2022
- July 2022
- June 2022
- May 2022
- April 2022
- March 2022
- February 2022
- January 2022
- December 2021
- November 2021
- October 2021
- September 2021
- August 2021
- July 2021
- June 2021
- May 2021
- April 2021
- March 2021
- February 2021
- January 2021
- December 2020
- November 2020
- October 2020
- September 2020
- August 2020
- July 2020
- June 2020
- May 2020
- April 2020
- March 2020
- February 2020
- January 2020
- December 2019
- November 2019
- September 2018
- October 2017
- December 2011
- August 2010