Followers | 14 |
Posts | 1998 |
Boards Moderated | 0 |
Alias Born | 06/15/2011 |
Thursday, December 14, 2023 3:09:51 PM
Karl Freund
Contributor
Founder and Principal Analyst, Cambrian-AI Research LLC
Dec 13, 2023,06:19pm EST
At the MI300 launch, AMD claimed it had significantly better performance than Nvidia. While the AMD chip does look good, and will probably run most AI just fine out of the box, the company did not use the fastest Nvidia software. The difference is enormous.
At a recent launch event, AMD talked about the inference performance of the H100 GPU compared to that of its MI300X chip. The results shared did not use optimized software, and the H100, if benchmarked properly, is 2x faster at a batch size of 1.
Nvidia has just released a blog that counters AMD's claim that its latest chip, the MI300X, is 40-60% faster in latency and throughput than Nvidia in inference processing for generative AI. Here is one of AMD’s slides at the MI300 launch event, which we covered here.
One of the incorrect slides AMD shared last week.
One of the incorrect slides AMD shared last week.AMD
Below is from Nvidia’s counterclaim. While this sort of tit-for-tat isn’t what anyone wants to hear, it is massively relevant this time; all the press and analyst reporting I’ve seen echo AMD’s claims, which are inaccurate and misleading.
The latest results, run on software available long before AMD prepared its presentation, doubled the performance claimed by AMD. And with batching for the 2.5-second latency AMD used, a standard in the industry, Nvidia beats the MI300 by an astonishing 14-fold.
The latest data from Nvidia leaves no doubt as to whose GPU is the fastest.
The latest data from Nvidia leaves no doubt as to [+]
NVIDIA
How Could This Happen?
It is simple. AMD did not use Nvidia’s software, which is optimized to improve performance on Nvidia hardware. “Though TensorRT-LLM is freely available on GitHub, recent comparisons by AMD used alternative software that does not yet support Hopper’s Transformer Engine and is missing these optimizations,” said the Nvidia blog post. Additionally, AMD did not take advantage of the TensorRT-LLM software that Nvidia released in September, doubling the inference performance on LLMs, nor the Triton inference engine. No TensorRT-LLM + no Transformer Engine + No Triton = non-optimal performance.
Since AMD has no equivalent software, it probably thought this was a better apples-t0-apples metric. These chips are expensive; I doubt anyone would not use the Nvidia software for production AI. It is free. “As LLM inference continues to grow in complexity, maximizing GPU performance on larger, increasingly sophisticated models using the latest inference software is critical to reducing cost and broadening adoption,” said Nvidia’s blog post.
What Does This Mean?
First, you can calm down if you are invested in Nvidia (stock or hardware). Nvidia remains the GPU leader. As Barron’s previously reported, “Investors Don’t Need to Worry.” And that was published before this latest news.
Second, if you are interested in the MI300X, we are not saying the new GPU is a bad AI platform. It appears to be the third fastest AI chip, behind Cerebras' massive WSE CS2 (for which there are no benchmarks) and the Nvidia H100. And that is probably good enough for those seeking a more available GPU that should be reasonably priced (whatever that means; AMD did not release pricing).
The AI hardware market is moving extremely fast, and the H100 will soon become old news. The H200 is coming more quickly than AMD probably hopes. We note that the MI300 FLOP specs are indeed better than Nvidia H100, and the MI300 also has more HBM memory. But it takes optimized software to make any AI chip sing and translate all those flops and bytes into customer value. The AMD ROCm software has made significant progress, but AMD still has much to do.
"AI is moving fast. NVIDIA’s CUDA ecosystem enables us to quickly and continuously optimize our stack. We look forward to continuing to improve AI performance with every update of our software,” said Nvidia.
Conclusions
While all this may seem like a tempest in a teapot to the uninitiated, all silicon vendors should work carefully to ensure accurate performance claims with actual data (not just normalized bar charts) and provide all the details necessary to reproduce those results. Handicapping a competitor's platform by not using the vendor's software isn't okay. That’s why MLCommons has published peer-reviewed MLPerf inference and training performance benchmarks every three months for several years.
Despite the kerfuffle, we stand by our earlier comments that AMD will sell every MI300 it can produce next year.
We asked AMD for a response and did not hear back.
When I asked Mark Papermaster, AMD CTO, if his company planned to run these benchmarks, he said they would publish MLPerf, but did not say when. We expect AMD will address the need for optimizations before they publish, and we can’t wait!
Recent NVDA News
- Form 8-K - Current report • Edgar (US Regulatory) • 11/07/2024 09:35:26 PM
- NVIDIA Names Ellen Ochoa to Board of Directors • GlobeNewswire Inc. • 11/07/2024 09:30:00 PM
- Trump Media Shares Surge 32% Pre-Market; Tesla Jumps 13%; Coinbase Rises with BTC All-Time High • IH Market News • 11/06/2024 11:09:49 AM
- Futures Pointing To Initial Rebound On Wall Street • IH Market News • 11/05/2024 02:01:10 PM
- U.S. Index Futures Rise as Election Looms; Oil and Gold Prices Steady • IH Market News • 11/05/2024 11:46:12 AM
- Tesla Sales Drop; Intel to Exit Dow Jones After 25 Years; VKTX and ATSG Surge Over 20% Pre-Market • IH Market News • 11/04/2024 11:20:51 AM
- NVIDIA and Sherwin-Williams Set to Join Dow Jones Industrial Average; Vistra to Join Dow Jones Utility Average • PR Newswire (US) • 11/01/2024 11:01:00 PM
- NVIDIA Sets Conference Call for Third-Quarter Financial Results • GlobeNewswire Inc. • 10/30/2024 09:00:00 PM
- NVIDIA Ethernet Networking Accelerates World’s Largest AI Supercomputer, Built by xAI • GlobeNewswire Inc. • 10/28/2024 03:00:00 PM
- Philips Sales Forecast Cut Sparks Stock Plunge; BP Resumes Libya Exploration; Alibaba Settles for $433 Million • IH Market News • 10/28/2024 10:09:33 AM
- Tesla Shares Rise 11% Pre-Market; IBM Falls 4% on Lower-Than-Expected Revenue; Apple Prepares New Launches • IH Market News • 10/24/2024 10:07:47 AM
- Purpose Investments Inc. Announces Risk Rating Change for NVIDIA (NVDA) Yield Shares Purpose ETF • GlobeNewswire Inc. • 10/22/2024 09:30:00 PM
- HSBC Begins Major Restructuring; SAP’s Cloud Revenue Grows 25%; Logitech Shares Drop 7.3% Pre-Market • IH Market News • 10/22/2024 10:04:11 AM
- iPhone 16 Sales in China Up 20%; Sobr Safe Jumps 108% Pre-Market; Netflix and Intuitive Surgical Beat Estimates • IH Market News • 10/18/2024 10:09:48 AM
- Lucid Drops 15% on New Share Offering; Phunware Continues to Rise in Pre-Market; Meta and Deere Cut Jobs • IH Market News • 10/17/2024 10:39:18 AM
- NVIDIA Contributes Blackwell Platform Design to Open Hardware Ecosystem, Accelerating AI Infrastructure Innovation • GlobeNewswire Inc. • 10/15/2024 04:30:00 PM
- Ericsson Shares Surge 7% on Strong Earnings, Nvidia Drops 2%, Google Invests in Nuclear Power for AI • IH Market News • 10/15/2024 09:46:33 AM
- TSMC To Exceed Profits, Plans European Factories; Boeing Cuts 17,000 Jobs; Berkshire Boosts Sirius XM Stake • IH Market News • 10/14/2024 10:11:42 AM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 10/11/2024 09:42:48 PM
- Tesla’s Robotaxi Day; GSK Settles 93% of Zantac Lawsuits; 10x Genomics Stock Drops on Revenue Forecast • IH Market News • 10/10/2024 10:45:48 AM
- Google Could Face Forced Spinoff; Rio Tinto Buys Arcadium for $6.7B; Boeing Suspends Union Talks • IH Market News • 10/09/2024 10:15:35 AM
- US Technology Leaders Tap NVIDIA AI Software to Transform World’s Industries • GlobeNewswire Inc. • 10/08/2024 03:21:21 PM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 10/07/2024 09:27:24 PM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 10/01/2024 08:19:18 PM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 09/26/2024 09:22:24 PM
SANUWAVE Announces Record Quarterly Revenues: Q3 FY2024 Financial Results • SNWV • Nov 8, 2024 7:07 AM
DBG Pays Off $1.3 Million in Convertible Notes, which Retires All of the Company's Convertible Notes • DBGI • Nov 7, 2024 2:16 PM
SMX and FinGo Enter Into Collaboration Mandate to Develop a Joint 'Physical to Digital' Platform Service • SMX • Nov 7, 2024 8:48 AM
Rainmaker Worldwide Inc. (OTC: RAKR) Announces Successful Implementation of 1.6 Million Liter Per Day Wastewater Treatment Project in Iraq • RAKR • Nov 7, 2024 8:30 AM
SBC Medical Group Holdings and MEDIROM Healthcare Technologies Announce Business Alliance • SBC • Nov 7, 2024 7:00 AM
VAYK Confirms Insider Buying at Open Market • VAYK • Nov 5, 2024 10:40 AM