Re: NGMA still has parts of P3 in it. The 411 decoder, now 4111 still can't decode more than one complex instruction per cycle. And if the next instruction is simple, can't even decode 1 complex instruction in a given cycle. K8 can do three in any given cycle. It makes it an all around performer on widely varying code.
Wrong again, Pete. AMD essentially has 3 simple decoders. Any "complex" instructions go through the vector path and the micro-code sequencer comes up with a uop equivalent, albeit at a cost to performance. You don't even know how this works, do you?