Friday, November 21, 2008 10:44:29 PM
Elmer:
Your backing of SPECxxx_rate_base2006 is not how most compare systems for HPC work. Because of the way this benchmark has evolved over time, it is fairly meaningless for most HPC users who will test each platform in their own environment and see which is both fast and, usually, cheap. For most others, even "base" is far from what they find in the real world when they use their production compiler (gcc or MS C (and variants)) and how the resulting software performs (this was the original goal of SPEC in the first place before being co-opted by OEM marketing departments). So "base" overstates what a normal developer would get for the typical environment and "peak" is far from the most an advanced savvy developer will get from a system.
The more typical server buyer is like what Dan3 does on SI, he gets each marketed system to test using his environment and their applications and see what their performance and how much would the system set them back for money and power used. Then after testing samples from each supplier they look at (or who wanted the sale), they choose the one that gets them the most performance for their budget. Frankly almost everyone says that is the best way to do it. SPEC CPU2006 might help get past that initial cut of who to bring in to test, just like TPC or SAP do, but its part of the beginning and nowhere near the end.
For the rest of us, its a debating point. I choose that because of Intel's "cheating" of flag handling, base is more meaningless than ever before (sometimes it got to the point that base and result scores matched exactly, which shouldn't happen in the real world). The standard result is closer to what savvy users would get, but given the tendency of SPEC not allowing the use of typical numeric libraries like BLAS is far from what starting HPC developers would get much less the savvy ones. Thus the result score gets closer, but isn't really indicative of real world results. Thus, that is the one I, reluctantly, use. And if you really check back, I have always used "peak" instead of "base" to compare CPUs.
The point is that Nehalem isn't out yet and the scores it gets have to be taken with a large dose of salt. Just look at SPECint_2006. At 24 cores, the top Xeon gets 25.5, at 16, its 25.0 (from 2.67GHz Dunnington to 2.93GHz Tigerton), at 12 there isn't one, at 8, its 30.3 (3.33GHz X5470 Harpertown), at 4, its 30.2 (3.5GHz Wolfdale, 3.2GHz i7 965 gets 33.6), at 2, its 26.3 (3.13GHz E3120 Wolfdale), at 1, its 17.4 (3GHz Woodcrest 5160). Generally the speed drops as the core count goes up (and the clock as well). Thus extrapolating speed from 1P to 4P is not 4x because the clock goes slower, some memory contention happens and cache coherency checks slow it down more. Else there would be no slow down as the core count goes up.
Given the above, Nehalem will not get 2x Shanghai when they actually meet. Not in typical server type loads and not in HPC work. Actually savvy HPC developers would likely get a bigger boost going to a GPGPU than by going to a higher speed CPU. A $500 GPGPU crunches numbers (480 DP Gflop/s (2.4 SP Tflop/s) far faster than a $500 CPU can (12 DP Gflop/s).
Pete
Your backing of SPECxxx_rate_base2006 is not how most compare systems for HPC work. Because of the way this benchmark has evolved over time, it is fairly meaningless for most HPC users who will test each platform in their own environment and see which is both fast and, usually, cheap. For most others, even "base" is far from what they find in the real world when they use their production compiler (gcc or MS C (and variants)) and how the resulting software performs (this was the original goal of SPEC in the first place before being co-opted by OEM marketing departments). So "base" overstates what a normal developer would get for the typical environment and "peak" is far from the most an advanced savvy developer will get from a system.
The more typical server buyer is like what Dan3 does on SI, he gets each marketed system to test using his environment and their applications and see what their performance and how much would the system set them back for money and power used. Then after testing samples from each supplier they look at (or who wanted the sale), they choose the one that gets them the most performance for their budget. Frankly almost everyone says that is the best way to do it. SPEC CPU2006 might help get past that initial cut of who to bring in to test, just like TPC or SAP do, but its part of the beginning and nowhere near the end.
For the rest of us, its a debating point. I choose that because of Intel's "cheating" of flag handling, base is more meaningless than ever before (sometimes it got to the point that base and result scores matched exactly, which shouldn't happen in the real world). The standard result is closer to what savvy users would get, but given the tendency of SPEC not allowing the use of typical numeric libraries like BLAS is far from what starting HPC developers would get much less the savvy ones. Thus the result score gets closer, but isn't really indicative of real world results. Thus, that is the one I, reluctantly, use. And if you really check back, I have always used "peak" instead of "base" to compare CPUs.
The point is that Nehalem isn't out yet and the scores it gets have to be taken with a large dose of salt. Just look at SPECint_2006. At 24 cores, the top Xeon gets 25.5, at 16, its 25.0 (from 2.67GHz Dunnington to 2.93GHz Tigerton), at 12 there isn't one, at 8, its 30.3 (3.33GHz X5470 Harpertown), at 4, its 30.2 (3.5GHz Wolfdale, 3.2GHz i7 965 gets 33.6), at 2, its 26.3 (3.13GHz E3120 Wolfdale), at 1, its 17.4 (3GHz Woodcrest 5160). Generally the speed drops as the core count goes up (and the clock as well). Thus extrapolating speed from 1P to 4P is not 4x because the clock goes slower, some memory contention happens and cache coherency checks slow it down more. Else there would be no slow down as the core count goes up.
Given the above, Nehalem will not get 2x Shanghai when they actually meet. Not in typical server type loads and not in HPC work. Actually savvy HPC developers would likely get a bigger boost going to a GPGPU than by going to a higher speed CPU. A $500 GPGPU crunches numbers (480 DP Gflop/s (2.4 SP Tflop/s) far faster than a $500 CPU can (12 DP Gflop/s).
Pete
Recent AMD News
- Semiconductor shares climb in premarket trade as investor confidence improves • IH Market News • 05/26/2026 12:55:28 PM
- Citi Forecasts Massive Expansion in Server CPU Market Through 2030 • UK Market News • 05/23/2026 02:02:13 PM
- Citigroup Sees Server CPU Market Reaching $132 Billion by 2030 as Intel Retains Leadership • IH Market News • 05/23/2026 02:01:14 PM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 05/22/2026 09:43:20 PM
- AMD begins ramp-up of next-generation EPYC chips using TSMC’s 2nm technology (AMD) • IH Market News • 05/21/2026 10:44:03 AM
- AMD Announces More Than $10 Billion in Taiwan Ecosystem Investments to Accelerate AI Infrastructure • GlobeNewswire Inc. • 05/21/2026 05:35:00 AM
- AMD Announces Production Ramp of Next-Generation AMD EPYC Processor “Venice” on TSMC 2nm Process Technology • GlobeNewswire Inc. • 05/21/2026 05:35:00 AM
- Form 144 - Report of proposed sale of securities • Edgar (US Regulatory) • 05/20/2026 08:53:40 PM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 05/19/2026 08:13:08 PM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 05/15/2026 09:05:45 PM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 05/15/2026 08:57:43 PM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 05/15/2026 08:55:26 PM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 05/15/2026 08:54:01 PM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 05/15/2026 08:52:39 PM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 05/15/2026 08:50:17 PM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 05/15/2026 08:49:14 PM
- Form S-8 - Securities to be offered to employees in employee benefit plans • Edgar (US Regulatory) • 05/15/2026 08:39:40 PM
- Form 8-K - Current report • Edgar (US Regulatory) • 05/15/2026 08:12:53 PM
- Form 144 - Report of proposed sale of securities • Edgar (US Regulatory) • 05/15/2026 08:07:05 PM
- AMD and ARM Extend Server Market Gains While Intel Loses Share (AMD) • IH Market News • 05/14/2026 10:30:40 AM
- Form 144 - Report of proposed sale of securities • Edgar (US Regulatory) • 05/13/2026 08:42:35 PM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 05/13/2026 08:24:34 PM
- Form 13F-HR - Quarterly report filed by institutional managers, Holdings • Edgar (US Regulatory) • 05/12/2026 09:00:25 PM
- Form 4 - Statement of changes in beneficial ownership of securities • Edgar (US Regulatory) • 05/12/2026 08:03:56 PM
- Form 144 - Report of proposed sale of securities • Edgar (US Regulatory) • 05/08/2026 08:05:43 PM
