Originally Posted by ninelven
No offence, but it is fairly obvious you don't have a very clear technical understanding of the matter. Please answer the following questions:
Exactly, how much die space do the ROPs take on Fermi? What is the percentage of the total area of Fermi? How much die space do the ROPs take on Tahiti, and what percentage of the total area?
Exactly, how should the ROPs and texture units be "enhanced?"
Obviously, more pixel and texture fill would be great, but there is no evidence I can see that what Nvidia has now is inefficient.
Inefficient is a relative term depending on the what competition has and what it can do given the a certain die size, and keep in mind that tahiti is only a 365mm^ die at 28nm, and it's only using 32 ROPs like the previous generation Cayman used on the HD 6970, and both GPU's aren't that far apart in clock speeds(75 Mhz), yet tahiti beats it by a mile on fillrate and texturing in those charts, so it's obvious that AMD did a lot of improvements on the back end of tahiti and it wasn't just the shaders.
Fermi is a 530mm^ die at 40nm as we all know, and simply shrinking that core down to 28nm still yeilds a core clocking in at 371mm^ without adding anything new to it, making it roughly the same size as tahiti on the HD7970....Would a straight Fermi shrink to 28nm, and clocked at the same speed as the core on tahiti used in tthe HD7970, and using the same type of memory clocked at the same speed, using the same 384 bit memory, be enough to beat it in raw fillrate and texturing speed....Short answer is no simply by looking at the what those charts suggest...Especially the texturing one(ouch).
The single precision math of a tahiti core is 3.7 terraflops and dual precision is just about 1 terraflop even, while Fermi does 1.56 terraflops single precision and 650 gigaflops dual precision, so simply shrinking the core to 28nm and clocking it another 200 Mhz higher isn't enough to match the Tahiti never mind offering even more performance, wich is has too.
Enhancing in this case simply comes down to doing more work for every clock cycle, and it needs it in every major area that affects both gaming performance and professional application performance, so whatever Kepler ends up being, it has to be something new from the ground up basically, not just an enhanced Fermi...