John Reynolds
One theory I've read is that the FX architecture is running PS1.4 over its floating point hardware (DX9 compliancy requires PS2.0 to be run on floating point precision) while its legacy shader support (1.1) is using integer functionality from its predecessors (GF3/4). Therefore the FX architecture could very well be faster running PS1.1 over its integer operators than PS1.4, though the former requires more passes.

And, Roscoe, this is getting scary. I'm starting to agree with almost everything I see you post these days.
