wbmw: As far as I've heard, the Opteron uses the exact same backend as the Athlon. That includes the execution units.
It does have two more pipeline stages, but they are likely fairly early in the pipeline (maybe an extra decode / reorder stage or something), but I don't recall seeing any hard material on this.
Presumably, the execution units would also have to have wider data paths and buffers.
But, yes, all in all not much of a difference there.
Of course, the funny thing about all this is the time-line. A feature like prefetching was reportedly originally developed for Hammer, then adapted to the Athlon core when Hammer was delayed and Intel pushing hard. The improved branch prediction falls in that catagory too, I believe.
-fyo