InvestorsHub Logo
icon url

pgerassi

05/11/05 12:51 AM

#55890 RE: wbmw #55889

Dear Wbmw:

I suggest you take a look at how data is moved between CPUs on Intel's FSB. One CPU can not ring up another CPU and move data. It must go at least to the NB as there are no CPU to CPU bus states, only idle, CPU to NB and NB to CPU. There is an arbitration done with the NB deciding who "wins" access to the FSB. And CPUs can snoop other CPU's transactions with the NB to keep the caches coherent. But that is not really communication between the CPUs as the data is only that which is stored on each CPUs cache.

As to Opterons not being able to "string" the CPUs to make a computer system, they can do just that and only need other chips to translate HTT to other types of I/O. In fact if you bury on a memory DIMM, a message passing device, you do not need anything on the MB, but power, reset, CPU sockets, DIMM sockets and traces connecting them. The BIOS can be placed on a DIMM as well. Granted that computer system will be quite limited, but it can work. Try that with an Intel MB with no NB.

And many A64 systems could be made to work with just one SB. It can have a GPU, NICs, IDEs, SATAs, USBs, sound, firewire, PS/2 and no memory controller and still be a complete computer even by your standards. And the very same SB can be used on a 8 socket Opteron system with no changes.

Pete
icon url

fastpathguru

05/11/05 1:38 AM

#55892 RE: wbmw #55889

By the way, the strength in Opteron doesn't come from talking directly to other Opterons, it comes from offering the best trade off between system bandwidth and average system latency to main memory.

Clearly, the fact that every Opteron you add to a system adds not just computational logic but another share of memory capacity, memory bandwidth, and system interconnect bandwidth (at the cost of slightly higher average memory latency) is a significant factor. It's not just that there are multiple pathways, its that the resources themselves scale.

Unfortunately, all the Xeons on the same bus have to share the same path to main memory...

But even presenting individual FSBs to each Xeon would hardly add anything... The NB's bandwidth to memory itself would have to double as well to keep pace with the Opteron. But that extra memory bandwidth would be useless in a 1P system. Xeon really needs a custom NB for each value of N in N-way systems.

It's the integration of 1 NB's worth of resources into each Opteron that lets it scale so efficiently.

fpg