InvestorsHub Logo
Followers 4
Posts 1914
Boards Moderated 0
Alias Born 09/18/2002

Re: economaniac post# 2571

Saturday, 04/19/2003 12:13:37 AM

Saturday, April 19, 2003 12:13:37 AM

Post# of 97868
The Intel fan-boys at Xbit-labs have posted a "hammer preview" of an early Athlon-64 (undeveloped single memory channel system) violating non-disclosure.

This isn't anything like the production systems, with final bios and dual memory channels that are due to be released next Tuesday. If AMD were as slimy as Intel, they'd have had some stooges release an "exclusive preview" of the new P4 3ghz on a board with a mis-set bios and PC100 SDRAM instead of DDR400:

When the 1.6ghz Athlon-64 with single memory channel is faster, it's noted with a brief sentence, when the 2.8ghz Pentium 4 with dual memory channels is faster, a glowing paragraph about the P4 follows.

Here's what they said:

We managed to get our hands on an engineering sample of the AMD Athlon 64 2800+ processor. We couldn’t help testing it, of course! In this article we are going to discuss the major features and benchmarks results for this processor.


Pages: 1 / 2 / 3 / 4 / 5 / 6 / 7 / 8 / 9 / 10 / 11
When Athlon processors came out in 1999, the competition in the processor market became much worse. The progressive architecture of this CPU allowed AMD to prove a worthy competitor to Intel and to give them quite a lot of causes for concern. The company’s processors didn’t yield to Intel’s solution in performance, and sometimes were even faster than those. However, it has become quite a hard task for AMD lately to retain this parity with Intel. During Athlon’s architecture life-time Intel managed to shift from Pentium III architecture to absolutely new Pentium 4 architecture, and then enhanced it significantly by increasing the L2 cache size and speeding up the system bus. Athlon also underwent certain enhancements, although, they never were that drastic. The most AMD did, included only system bus overclocking, L2 cache size increase, implementation of SSE support. As a result, Intel is now somewhat ahead of the competitor: Pentium 4 working frequencies grow very rapidly, while AMD has reached the top of Athlon architecture potential.


However, AMD has one very strong move reserved, which can change the situation in the market dramatically. The company prepares a new Athlon 64 processor to enter the desktop CPUs market. And even though they have been delaying this processor a number of times, so that the actual launch date has been postponed for almost a year since the initial schedule, the new architecture to be introduced in this CPU keeps exciting the public quite a lot. Take for instance the fact that Intel Company started paying quite a bit of attention in its confidential documents to the competitiveness of their Pentium 4 processor family against the upcoming AMD Athlon 64:

Closer Look: CPU
During the work on the new processor, AMD took the successful Athlon architecture as a basis. That is why the newly designed CPU has very much in common with the predecessor. Without going into details, I will list here the major differences between the ClawHammer core of the upcoming Athlon 64 processor and the contemporary Athlon XP.

Athlon 64 features better branch prediction algorithms and bigger TLB, which will make Athlon 64 run faster than Athlon XP working at the same clock frequency.
Integer pipeline of Athlon 64 is a little longer, which allows reaching higher clock frequencies. The integer pipeline of this processor has 12 stages altogether, while the FPU – 17 stages, against 10 and 17 stages respectively by Athlon XP.
Athlon 64 supports SSE2 instructions. This way this processor will support all existing extensions of x86 instructions set, which will allow it to work faster with applications optimized for Intel Pentium 4 processors.
Athlon 64 feature a built-in DDR SDRAM controller, so that it can communicate with the memory directly thus reducing the latencies on data requests.
The CPU and the South Bridge will be connected with the HyperTransport bus with up to 3.2GB/sec bandwidth in each direction.
Athlon 64 processor supports x86-64 technology, which allows it to work in special 64bit applications.

It’s also an Athlon, but a different one.

Well, please meet the CPU, which we managed to get:


AMD Athlon 64 2800+

As we see, our CPU is marked according to the rating, i.e. as 2800+. The production date in the next line of the marking indicates the beginning of this year. Athlon 64 processor with this rating is most likely to become the slowest model in the new processor family, when it is finally launched.

As we see, AMD is not going to give up the idea of marking CPUs with the rating instead of writing their actual core frequency. Well, let’s find out what hides behind this marking.


Many of you may be disappointed:
the actual core frequency of this processor is only 1.6GHz

The actual working frequency of this processor is 1.6GHz. When AMD faced some problems with increasing the ClawHammer core frequencies, they had to make the L2 cache bigger, to ensure adequate performance of the new CPUs. That is why our Athlon 64 2800+, working at pretty low actual core clock, features a 1MB L2 cache. The first mass Athlon 64 processors will have exactly the same L2 cache configuration. A bit later AMD will also launch a less expensive version with 256KB L2 cache.

By the way, you shouldn’t forget that Athlon 64, just like Athlon XP, features an exclusive L2 cache. In other words, taking into account the 128KB L1 cache (64KB for instructions and 64KB for data), inherited from Athlon XP, the overall cache memory of the new Athlon 64 makes 1128KB. The 0.13micron SOI technology used for Athlon 64 production allows placing so much cache memory on the same die with the actual CPU. As you may know, Intel will implement 1MB L2 cache only in Prescott processors, which will be manufactured with 0.09micron technology.

As for the structure of cache memory, the L2 cache of the Athlon 64 (just like the L2 cache of Athlon XP) features 16 associativity fields and 64-Byte long lines.


L2 cache of Athlon 64 is built like the L2 cache
of the current Athlon XP

Speaking about the support of different technologies, we should resort to the results obtained with the help of SiSoft Sandra 2003 benchmark:


3DNow!, x86-64 and SSE2 are all here

As we have expected, the CPU supports old instructions sets, like: MMX, 3DNow!, and SSE, as well as SSE2 and x86-64, which are completely new for AMD processors. Also note that the CPU features a built-in thermal diode. Hopefully, Athlon 64 is a hard nut to cook. Also AMD has finally listened to users’ complaints and hid the fragile CPU die under a copper lid with nickel-plated surface.

The CPU we tested was built with the B0 core revision. It is a much newer core than those used in the very first samples working at 800MHz and 1.2GHz core frequency. AMD redesigned this core to increase the clock frequencies and improved the embedded memory controller a little bit. As a result, our processor supported not only DDR266/DDR333 SDRAM, but also DDR400 SDRAM. However, unlike the server Opteron CPU, Athlon 64 will have a single-channel memory controller that is why the maximum memory bandwidth of the Athlon 64 based systems will make 3.2GB/sec.

As for HyperTransport bus, connecting Athlon 64 with the mainboard chipset, our CPU featured one 16bit bus working at 800MHz, which provided the bandwidth of 3.2GB/sec in each direction. However, we doubt that Athlon 64 will really need such a high-performance bus, because unlike other CPUs, it will not transfer any data to/from the system memory. Anyway, extra bandwidth has never been a drawback.

Testbed and Methods
First of all, I should point out that all our tests were run in 32bit Windows XP operation system and in 32bit applications. Unfortunately, 64bit operation systems and applications supporting x86-64 are not available yet. By the time Athlon 64 is out we have every reason to expect 64bit Windows XP version, as well as a few 64bit applications to be released. Among the first 64bit applications to come is the Unreal Tournament game, which upcoming announcement has already been disclosed by Epic. The use of 64bit applications may result into a significant boost of Athlon 64 performance due to involvement of additional registers and their expansion. However, there will hardly be too much software supporting x86-64 this year. Even AMD evaluates its prospects in terms of 64bit applications promotion as follows:


By the beginning of 2004 there will be
around 20% of 64bit software.

In other words, the results our Athlon 64 2800+ will demonstrate today, will not illustrate the performance these processors will show in 64bit operation systems and applications.

In our test session we will compare Athlon 64 2800+ with the following processors:

Athlon XP 2800+ based on Barton core (512KB L2 cache, 333MHz bus). This processor features the same rating as our Athlon 64, but it works at a much higher actual frequency of 2083MHz. Testing this processor today will allow us to estimate how true the rating used for Athlon 64 processors marking is.
Athlon XP based on Thoroughbred core working at 1.6GHz core clock and 400MHz bus. Testing this processor today will allow us to evaluate how efficient were the architectural improvements introduced in Athlon 64 and how greatly they tell on the processor performance.
Pentium 4 2.8C GHz processor. This CPU working with 800MHz bus and supporting Hyper-Threading technology will be one of the major competitors to Athlon 64 2800+ by the time the new AMD processor is out.
Pentium 4 2.53GHz processor working with 533MHz bus and featuring no Hyper-Threading support. This CPU will be one of the slowest Pentium 4 processors in the coming fall, so that we will be able to get a better idea of the Athlon 64 performance.
So, we will test the following systems:


Athlon 64 2800+
Athlon XP 2800+
Athlon XP 1.6GHz
Pentium 4 2.8C
Pentium 4 2.53GHz

Mainboard
Good one (VIA K8T400M)
ABIT NF7 (NVIDIA nForce2)
ASUS P4C800 (i875P)

Memory
512MB DDR400 SDRAM (Corsair XMS3200 v.1.1)

Graphics card
ATI RADEON 9700 PRO

HDD
Seagate Barracuda ATA IV, 80GB


The tests were run in Microsoft Windows XP SP1, the system’s BIOS was setup for maximum performance.

Well, before you go to the next page to get the whole bunch of benchmarks results, we would like to remind you that Athlon 64 is not yet ready, therefore, all the results brought up in this article are preliminary results. AMD may polish off its CPU a bit more, the mainboard guys may update their BIOS’s, and VIA may introduce some changes in the final chipset version. That is why the performance of mass platforms built with Athlon 64 processors after their official launch may grow higher.

Performance
First of all, we decided to take a closer look at the performance of the built-in memory controller. Especially since we haven’t yet had a chance to play with the DDR SDRAM controller built into a CPU, even though some similar technology is used by SoC solutions developers.

First let’s check the numbers obtained in Cachemem benchmark, which we always use when there is a new memory controller tested:

Athlon 64 2800+
Athlon XP 1.6GHz
Pentium 4 2.8C

Memory read speed, MB/s
2610.2
1747.8
3193.5

Memory write speed, MB/s
1099
1156.9
1320.5

Memory copy speed, MB/s
1541.7
1244.8
2678.6

Latency
96
165
260


This table sums up the data obtained for three different memory controllers: the one integrated into Athlon 64, the one from the nForce2 chipset and the one from i875P chips. All controllers were used with DDR400 SDRAM, nForce2 and i875P worked in dual-channel mode.

As we see, the single-channel Athlon 64 controller is much faster than the dual-channel controller of the nForce2 core logic, which cannot show its best because of the limited processor bus bandwidth. At the same time, Athlon 64 had hard time trying to compete with the bandwidth of i875P dual-channel controller. But as soon as it comes to latency, Athlon 64 becomes an indisputable leader having left all the rivals far behind. Thanks to this processor’s ability to work with the memory directly, the latency during work with the system memory turns out very low.

Performance
First of all, we decided to take a closer look at the performance of the built-in memory controller. Especially since we haven’t yet had a chance to play with the DDR SDRAM controller built into a CPU, even though some similar technology is used by SoC solutions developers.

First let’s check the numbers obtained in Cachemem benchmark, which we always use when there is a new memory controller tested:

Athlon 64 2800+
Athlon XP 1.6GHz
Pentium 4 2.8C

Memory read speed, MB/s
2610.2
1747.8
3193.5

Memory write speed, MB/s
1099
1156.9
1320.5

Memory copy speed, MB/s
1541.7
1244.8
2678.6

Latency
96
165
260


This table sums up the data obtained for three different memory controllers: the one integrated into Athlon 64, the one from the nForce2 chipset and the one from i875P chips. All controllers were used with DDR400 SDRAM, nForce2 and i875P worked in dual-channel mode.

As we see, the single-channel Athlon 64 controller is much faster than the dual-channel controller of the nForce2 core logic, which cannot show its best because of the limited processor bus bandwidth. At the same time, Athlon 64 had hard time trying to compete with the bandwidth of i875P dual-channel controller. But as soon as it comes to latency, Athlon 64 becomes an indisputable leader having left all the rivals far behind. Thanks to this processor’s ability to work with the memory directly, the latency during work with the system memory turns out very low.
There is something to think of here. Take, for instance, ALU performance in Athlon 64, which has got 8% higher than that of the Athlon XP ALU, according to this test. This improvement was possible due to improved branch prediction and TLB. However, this is far not enough for successful competition with Intel Pentium 4, where ALU works at doubled frequency. As for the FPU performance, this unit remained unchanged that is why the performance is also the same as by Athlon XP. And this is more than enough to defeat Intel Pentium 4 2.8C, despite the support of Hyper-Threading technology. Well, the FPU unit of Athlon processors was made very powerful from the very beginning. As far as SSE2 unit of Athlon 64 is concerned, it turned out a disappointment. Our Athlon 64 fell quite far even behind Pentium 4 2.53GHz.

Will Athlon 64 2800+ be able to perform at least as fast as Athlon XP 2800+?

Well, it looks as if it could. Although a lot will depend on each particular task. The low clock frequency of Athlon 64 2800+ (1.6GHz) pushes it behind Athlon XP 2800+ working at 2.083GHz in terms of computational power. However, as soon as it comes to operations with the memory, Athlon 64 manages to get ahead.

Now let’s check the situation in real applications.

The complex Business Winstone 2002 test set illustrating the average performance in typical office applications, showed that the good old Athlon XP 2800+ was about 8.5% faster than the new Athlon 64 2800+. The only reason for this situation is the 30% higher working frequency of the Athlon XP 2800+ processor. Unfortunately, neither the huge cache, nor the fast memory subsystem of Athlon 64 help this processor to make up for lower core clock. At the same time, Athlon 64 2800+ outperforms Athlon XP 1.6GHz by the same 8%.

To our great disappointment, we didn’t manage to run Multimedia Content Creation Winstone 2003 test on the Athlon 64 based system. During this test the same error kept popping up, though it had luckily nothing to do with the problems of the tested platform.

AMD is not very enthusiastic about SYSmark2002 benchmark, believing that the testing algorithms used in it favor the competitor’s products. Maybe it is true. What is worth your attention here, however, it’s the continuing lag of the Athlon 64 behind Athlon XP of the same rating. But here it is not such a big lag, actually, as the one we saw in Business Winstone 2002. Our hero is only 2-3% behind Athlon XP 2800+.

During mp3 encoding with lame codec the new Athlon 64 suffers a complete failure. Sound files encoding is a task that requires high “pure” CPU performance, which is a problem for Athlon 64 because of its low core clock frequency. So, as you may see on the diagram, the result shown by Athlon 64 2800+ is close to that of Athlon XP 1.6GHz.

And during data compression with WinRAR utility, Athlon 64 shows its best. Due to a large cache, which can save the biggest part of the dictionary, and also low memory subsystem latency, Athlon 64 2800+ easily outperforms not only Athlon XP 2800+, but even Intel Pentium 4 2.8C with 800MHz bus and Hyper-Threading technology.

Video encoding into mpeg-4 format again changes the whole picture. Hyper-Threading technology allows Intel Pentium 4 2.8C to get ahead of the racers. While Athlon 64 2800+ is again behind Athlon XP 2800+, though only a little bit this time.

In 3DMark03 Athlon 64 2800+ is finally ahead of Athlon XP 2800+. However, Intel Pentium 4 2.8C is still the definite leader.

CPU Score is a parameter obtained during the vertex shaders emulation by the system CPU. That is why fast ALU and high memory subsystem bandwidth matter a lot in this test. As a result, Athlon 64 2800+ is again not only behind Pentium 4 2.8C, but also behind Athlon XP 2800+.

In 3DMark2001 SE Athlon 64 2800+ managed to almost catch up with Pentium 4 2.8C and to outperform Athlon XP 2800+ quite significantly.

The similar picture can be seen in Return to Castle Wolfenstein game built on Quake3 gaming engine.

Unreal Tournament 2003 has always run fast with Athlon XP processors. However with Athlon 64, this game is even faster. Athlon 64 2800+ is the indisputable leader here.

Athlon XP used to be fairly considered the best CPU for scientific calculations due to its powerful three-pipeline FPU. Athlon 64 retained this FPU unit also. While the clock frequency dropped. As a result, the newcomer cannot work as fast as its predecessor.

Rendering in 3ds max5 is a typical computational task. The architectural improvements introduced in Athlon 64 can’t make up for its low working frequency, just like in scientific tests. As a result, Athlon XP 2800+ shows 12% higher rendering speed than Athlon 64 2800+. Nevertheless, the freshly introduced enhancements in Athlon 64 do pay back in this benchmark, since the clock frequency difference makes over 30%, while the results difference is much lower.

In Lightwave 7.5 the rendering speed depends a lot on the scene type. In some cases Athlon 64 can show its best due to larger L2 cache, SSE2 instructions support and low latency during memory requests. And in other cases it performance gets close to Athlon XP 1.6GHz.

Cinebench 2003 [a fresh ringer from Intel's benchmark fab?] is about too many calculations and Athlon 64 can’t cope with them successfully. By the way, as soon as Intel introduced Hyper-Threading technology support in its Pentium 4 processors, their performance in rendering applications grew up immensely

Conclusion
The major conclusion, which we can draw as a result of this test session, sounds as follows. Even though Athlon 64 processors have internal architecture very similar to that of Athlon XP processors, they still differ from their predecessors quite significantly from the practical point of view. We can’t give you a definite answer to the question, if Athlon 64 has become any faster than Athlon XP. In fact, this is just a different processor.

Moreover, there is also not much we could say about the performance of Athlon 64 in 64bit applications or at least in 64bit operation systems and 32bit applications. Supposedly, x86-64 will ensure a significant performance improvement, but it is also quite possible that x86-64 will not receive a warm welcome from the software developers. AMD has already tried to promote its own instructions set a while ago, and this experience could hardly be regarded as a success, to tell the truth. 3DNow! instructions set failed to become widely spread even though it proved to be very convenient to work with. So, we can only wait here for the first signs for or against these suppositions.

Speaking about the performance of Athlon 64 in traditional 32bit applications we can say that this new CPU boasts a few very remarkable and strong features: large L2 cache, high-performance memory subsystem and SSE2 instructions support. On the other scale we see relatively low core frequency. As a result, we see either a performance boost or a performance drop depending on each particular application and its critical parameters.

For example, Athlon 64 is not very successful in traditional calculating tasks, such as scientific calculations or 3D rendering. But as soon as we get to games or info compression, it appears beyond any competition. In general, if we compare the performance of Athlon 64 2800+ with that of Athlon XP 2800+, we will have to admit that the latter appeared slower than our today’s hero in quite a bit of benchmarks.

And in conclusion, I would like to remind you once again that this article is none other but the first look at AMD Athlon 64 performance. By the time it is launched officially this fall, a lot of things may change. For instance, AMD may raise the core clock frequency of these processors














Volume:
Day Range:
Bid:
Ask:
Last Trade Time:
Total Trades:
  • 1D
  • 1M
  • 3M
  • 6M
  • 1Y
  • 5Y
Recent AMD News