InvestorsHub Logo
icon url

dacaw

10/18/04 11:30 AM

#46004 RE: chipguy #46003

MMX & FP in Video code.

There'a a lot of data shuffling that goes on and clamping the outputs as well as initial operations that can be done in integer.

Basically the loop sequence is:

1) Organize the integer*16 data (MMX)
2) Do operations in integer that lose no accuracy (MMX) e.g 16*16 -> 32 or simple adds
3) Convert to FP
4) Do the muliplicative parts (lots and lots of FP calculation)
5) Convert back to integer
6) Clamp the output (MMX)
7) Organize the output (MMX)

I will do a version that replaces the MMX with SSE2 when I get time. Haven't looked at the code in some years