HBM-on-GPU set to power the next revolution in AI accelerators – and just to confirm, there’s no way this will come to your video card anytime soon
- 3D HBM-on-GPU design reaches record compute density for demanding AI workloads
- Peak GPU temperatures exceeded 140°C without thermal mitigation strategies
- Halving the GPU clock rate reduced temperatures but slowed AI training by 28%
Imec presented an examination of a 3D HBM-on-GPU design aimed at increasing compute density for demanding AI workloads at the 2025 IEEE International Electron Devices Meeting (IEDM).
The thermal system-technology co-optimization approach places four high-bandwidth memory stacks directly above a GPU through microbump connections.
Each stack consists of twelve hybrid-bonded DRAM dies, and cooling is applied on top of the HBMs.
Thermal mitigation attempts and performance trade-offs
The solution applies power maps derived from industry-relevant workloads to test how the configuration responds under realistic AI training conditions.
This 3D arrangement promises a leap in compute density and memory per GPU.
It also offers higher GPU memory bandwidth compared to 2.5D integration, where HBM stacks sit around the GPU on a silicon interposer.
However, the thermal simulations reveal severe challenges for the 3D HBM-on-GPU design.
Sign up to the TechRadar Pro newsletter to get all the top news, opinion, features and guidance your business needs to succeed!
Without mitigation, peak GPU temperatures reached 141.7°C, far above operational limits, while the 2.5D baseline peaked at 69.1°C under the same cooling conditions.
Imec explored technology-level strategies such as merging HBM stacks and thermal silicon optimization.
System-level strategies included double-sided cooling and GPU frequency scaling.
Reducing the GPU clock rate by 50% lowered peak temperatures to below 100°C, but this change slowed AI training workloads.
Despite these limitations, Imec argues that the 3D structure can deliver higher compute density and performance than the 2.5D reference design.
“Halving the GPU core frequency brought the peak temperature from 120°C to below 100°C, achieving a key target for the memory operation. Although this step comes with a 28% workload penalty…” said James Myers, System Technology Program Director at Imec.
“…the overall package outperforms the 2.5D baseline thanks to a higher throughput density offered by the 3D configuration. We are currently using this approach to study other GPU and HBM configurations…”
The organization suggests this approach could support thermally resilient hardware for AI tools in dense data centers.
Imec presents this work as part of a broader effort to link technology decisions with system behavior.
This includes the cross-technology co-optimization (XTCO) program, launched in 2025, which combines STCO and DTCO mindsets to align technology roadmaps with system scaling challenges.
Imec said that XTCO enables collaborative problem-solving for critical bottlenecks across the semiconductor ecosystem, including fabless and system companies.
However, such technologies will likely remain confined to specialized facilities with controlled power and thermal budgets.
Via TechPowerUp
Follow TechRadar on Google News and add us as a preferred source to get our expert news, reviews, and opinion in your feeds. Make sure to click the Follow button!
And of course you can also follow TechRadar on TikTok for news, reviews, unboxings in video form, and get regular updates from us on WhatsApp too.
3D HBM-on-GPU design reaches record compute density for demanding AI workloads Peak GPU temperatures exceeded 140°C without thermal mitigation strategies Halving the GPU clock rate reduced temperatures but slowed AI training by 28% Imec presented an examination of a 3D HBM-on-GPU design aimed at increasing compute density for demanding AI…
Recent Posts
- This chunky little tablet got my kid to clean up his toys
- OpenAI will let the US government review its AI models before release
- Seagate FireCuda X Vault review: Large capacity and decent transfer rates make this external hard drive a great solution for video and photography
- I customized a MacBook Neo with colorful spare parts
- EveryPlate Meal Kit Review (2026): Low Cost, Simplicity, Flavor
Archives
- June 2026
- May 2026
- April 2026
- March 2026
- February 2026
- January 2026
- December 2025
- November 2025
- October 2025
- September 2025
- August 2025
- July 2025
- June 2025
- May 2025
- April 2025
- March 2025
- February 2025
- January 2025
- December 2024
- November 2024
- October 2024
- September 2024
- August 2024
- July 2024
- June 2024
- May 2024
- April 2024
- March 2024
- February 2024
- January 2024
- December 2023
- November 2023
- October 2023
- September 2023
- August 2023
- July 2023
- June 2023