The Next Platform
  • Home
  • Compute
  • Store
  • Connect
  • Control
  • Code
  • AI
  • HPC
  • Enterprise
  • Hyperscale
  • Cloud
  • Edge
Latest
  • [ January 27, 2025 ] How Did DeepSeek Train Its AI Model On A Lot Less – And Crippled – Hardware? AI
  • [ January 24, 2025 ] Brad McCredie Is The Pedal To AMD’s Datacenter GPU Metal Compute
  • [ January 23, 2025 ] GenAI Boom: Datacenter Spending Forecast Raised Again Compute
  • [ January 23, 2025 ] How Enterprise AI Can Ease The Data Gravity Burden Store
  • [ January 22, 2025 ] OpenAI Declares Its Hardware Independence (Sort Of) With Stargate Project AI
  • [ January 21, 2025 ] HLRS Takes First Steps To Exascale HPC
  • [ January 16, 2025 ] TSMC Can’t Be Caught Or Bought, Only Sought Or Stolen Compute
  • [ January 16, 2025 ] Using NIM Guardrails To Keep Agentic AI From Jumping To Wrong Conclusions AI
Homeinference

inference

AI

Cerebras Trains Llama Models To Leap Over GPUs

October 25, 2024 Timothy Prickett Morgan 11

It was only a few months ago when waferscale compute pioneer Cerebras Systems was bragging that a handful of its WSE-3 engines lashed together could run circles around Nvidia GPU instances based on Nvidia’s “Hopper” H100 GPUs when running the open source Llama 3.1 foundation model created by Meta Platforms. …

AI

Cerebras Needs Wall Street To Expand Beyond One Core Customer

October 2, 2024 Timothy Prickett Morgan 5

Waferscale compute engine and AI system maker Cerebras Systems has filed with the US Securities and Exchange Commission to sell a chunk of itself to the public, giving we outsiders a view of the past two and a half years of its internal financials. …

AI

The Battle Begins For AI Inference Compute In The Datacenter

September 10, 2024 Timothy Prickett Morgan 3

The major cloud builders and their hyperscaler brethren – in many cases, one company acts like both a cloud and a hyperscaler – have made their technology choices when it comes to deploying AI training platforms. …

Compute

The First AI Benchmarks Pitting AMD Against Nvidia

September 3, 2024 Timothy Prickett Morgan 4

Rated horsepower for a compute engine is an interesting intellectual exercise, but it is where the rubber hits the road that really matters. …

AI

Stacking Up Intel Gaudi Against Nvidia GPUs For AI

June 13, 2024 Timothy Prickett Morgan 12

Updated: Here is something we don’t see much anymore when it comes to AI systems: list prices for the accelerators and the base motherboards that glue a bunch of them together into a shared compute complex. …

AI

Talking AI Costs And Addressable Markets With SambaNova

February 14, 2024 Timothy Prickett Morgan 1

The only way to accurately predict the future is to live it, but just the same, prognostication is one of the things that we humans love to do. …

AI

How AWS Can Undercut Nvidia With Homegrown AI Compute Engines

December 4, 2023 Timothy Prickett Morgan 1

Amazon Web Services may not be the first of the hyperscalers and cloud builders to create its own custom compute engines, but it has been hot on the heels of Google, which started using its homegrown TPU accelerators for AI workloads in 2015. …

AI

Groq Says It Can Deploy 1 Million AI Inference Chips In Two Years

November 27, 2023 Timothy Prickett Morgan 2

If you are looking for an alternative to Nvidia GPUs for AI inference – and who isn’t these days with generative AI being the hottest thing since a volcanic eruption – then you might want to give Groq a call. …

AI

Big Blue Can Still Catch The AI Wave If It Hurries

November 6, 2023 Timothy Prickett Morgan 5

It has been two and a half decades since we have seen a rapidly expanding universe of a new kind of compute that rivals the current generative AI boom. …

AI

Optimizing AI Inference Is As Vital As Building AI Training Beasts

September 11, 2023 Timothy Prickett Morgan 8

The history of computing teaches us that software always and necessarily lags hardware, and unfortunately that lag can stretch for many years when it comes to wringing the best performance out of iron by tweaking algorithms. …

Posts pagination

1 2 … 4 »
About

The Next Platform is part of the Situation Publishing family, which includes the enterprise and business technology publication, The Register.

TNP  offers in-depth coverage of high-end computing at large enterprises, supercomputing centers, hyperscale data centers, and public clouds. Read more…

Newsletter

Featuring highlights, analysis, and stories from the week directly from us to your inbox with nothing in between.
Subscribe now

  • RSS
  • Twitter
  • Facebook
  • LinkedIn
  • Email the editor
  • About
  • Contributors
  • Contact
  • Sales
  • Newsletter
  • Books
  • Events
  • Privacy
  • Ts&Cs
  • Cookies
  • Do not sell my personal information

All Content Copyright The Next Platform