Uttar
01-28-03, 01:19 PM
Hey,
I'd like to comment on the GFFX. But then, why does this deserve a new thread? Well, that's quite simply because I'm going to give a completely different reasoning, and conclusion, than everyone else :) And no, I'm not going to say the GFFX is the greatest thing ever. It isn't. So read on...
First of all, let me say Anand did excellent IQ tests. IMO, he just didn't interpret the performance / IQ correctly.
Let me begin by talking about Antialiasing. Let's see what the respective performance hits are, shall we?
http://www.anandtech.com/video/showdoc.html?i=1779&p=13
GFFX 4x AA: 24% performance hit
Radeon 9700 Pro 4x AA: 25% performance hit
As you can see, the GFFX AA is *cheaper* than the Radeon 9700 Pro. Why? After all, it got 20% less bandwidth...
The answer, sadly, isn't so pretty. The GFFX 4x AA patterns are ordered, giving lower IQ and more repeating colors. Notice the "more repeating color" part? That means the SAME compression algorithm would work better on the GFFX 4x than on the Radeon 9700 Pro 4x!
However, in most cases that shouldn't make much of a difference. So the GFFX performance hit with the Radeon 9700 Pro pattern might be more like 27% or something.
Now, why is 6xS so darn slow? The answer lies the the SSAA part of it. Color Compression is a LOT less efficient with that, because not all subpixels are identical. The Radeon 9700 Pro 6x AA got all subpixels identical, so it's really normal the performance is a lot better.
4x AA scores for the GFFX, in fact, are amazingly good seeing how the GFFX got 20% less memory bandwidth than the Radeon 9700 Pro. Very few reviews got 4x AA only scores, but two do. Hexus & Tom. Both sites conclusion is ridiculous and the AF comparaison is not fair at all ( ATI got a signifiant disadvantage ) , but I doubt the benchmarks for AA are tricked.
http://www.hexus.co.uk/review.php?review=497&page=8
http://www.tomshardware.com/graphic/20030127/geforce_fx-19.html#4x_fsaa
http://www.hexus.co.uk/review.php?review=497&page=6
As you can see, in the worst case the GFFX got a 11% lower score than the Radeon 9700 Pro, with 20% slower memory. On average, they're actually *on par* with only 4x AA!
Sounds like ATI is getting a little bit more waste than nVidia with their 256-bit memory bus.
Conclusion for AA: The GFFX got two big disadvantages. Worse 4x AA patterns, and no 6x true MSAA mode. The "no true 6x MSAA mode" is only a problem if you run at lower resolutions, because seeing the difference between 1600x1200 4x AA & 6x AA is just impossible IMO.
As for the worse 4x AA patterns. That's what 4xS is supposed to fix. But the problem is that 4xS also got SSAA in it, so performance will be a lot worse. That, thus, is a huge problem.
I already said the 2x AA quality is a bug in another thread, and that it should be fixed easily with drivers. So I won't count that as a disadvantage ( it'll be fixed when we'll get to buy the card )
Maybe 4x AA is really rotated, who knows. nVidia seems to claim that AA&AF things are happening after the frame buffer, so maybe that isn't captured correctly either. But until I get a proof of it, I refuse to trust that.
Aniso now... Let's look at the numbers.
http://www.anandtech.com/video/showdoc.html?i=1779&p=10
Now, we can see nVidia's biggest problem. The HUGE performance hit when using Balanced, and the bad IQ when using aggressive. The solution, for nVidia, would be to make an intermediary setting between Aggressive and Balanced, and I really hope they'll do so. However, I fear that's unlikely :(
Right now, there's really two things thatc can be done to compare Aniso. In case you want the very best quality, you compare nVidia's 8x Balanced with ATI's 8x Performance. There, the GFFX loses very badly.
The other solution is to compare nVidia's 8x Aggressive with ATI's 4x Performance: this gives significantly lower IQ, but the IQ is the same on both cards and the performance hit is also nearly identical.
Conclusion on Aniso: Right now, nVidia got a clear disadvantage here. It is the GFFX biggest problem. I urge nVidia to implement an intermediary mode between Aggressive and Balanced ( preferably via drivers, but if it's not possible that's essantial for the NV35 ) - otherwise, they've badly lost that battle.
Now, there are several other things people have complained about, or said which were quite anormal. What are those, you ask?
First of all, some of 3DMark's 2001 SE synthetic scores are *not* what most people would have supposed based on paper specs. So, what happened?
http://www.tomshardware.com/graphic/20030127/geforce_fx-27.html
The Radeon 9700 Pro actually scores a *higher* Single Texturing score! Why? Bandwidth.
With my GF4 Ti4200 at 250/250, my single-texturing score is of 861.8MP/s
At 250/200, my score drops to 726.6MP/s due to bandwidth limitations.
So, that one is explained and drivers won't be able to do much about it: 3DMark 2001 SE fillrate tests are ridiculous and false.
Now, there's also the Advanced Pixel Shader test which uses PS1.4. where ATI got a nice lead. I've got no data to back this reasoning, but anyway...
It was revealed that DX9 was not working in those drivers. That mean PS1.4. support is probably pretty early in them, and not much work was done on it yet. So it sounds reasonable to say that nVidia has done few optimizations, and it's the area they've focused the less in.
Thus, improvements will mostly be seen in such areas.
As for a slower VS speed than the Radeon 9700 Pro when using complex shading programs. It's hard to say, really. Those programs aren't *so* complex, and bandwidth might be an issue. Also, the GFFX shading architecture is really optimized for very long programs, so I really think those programs might not put it in its best light.
Of course, huge programs won't be seen in games for a while. So those synthetic benchies most certainly show us that the GFFX VS is a little bit inferior in games which are going to release soon.
Finally... The dustbuster, aka Flow FX. Well, the noise really seems bad. But I guess that was known since they announced the whole system. However, I'd like to see lower 2D clock rates. 300/300 is completely useless in 2D! 250/250 is really a maximum, and it would generate less noise. Okay, so you can change that yourself with coolbits. But not everyone is going to do that, and I think it would be a good change.
Also, it would be nice if Flow FX could automatically put the card to something more like 400/400 if it sees FPS is more than, for example, 100. That obviously can't be done via drivers, but it's already on my NV35 wishlist.
Conclusion: On the feature side of things, nVidia is a clear winner. The GFFX is future oriented, and got a lot of cool things for developers. It's unlikely, however, that they will be used in the very short life of the card.
AA Performance is on par with what I expected. AA IQ, however, isn't: I had hoped for a pure MSAA mode with a rotated grid. 4xS performance hit is probably just gonna be too big.
AF Performance is good too, but AF IQ is not. That means AF is really a lot worse than what I had expected.
nVidia got a fast algorithm with the 8x Aggressive mode, but its IQ is on par with ATI's 4x Performance mode and that's the best the adaptive modes can do! Thus, many reviewers choose to use "balanced" and that's why the GFFX is losing so much in all those benchmarks.
The dustbuster is a problem, but it's not huge.
For the GFFX to satisfy what I hoped for, the following should be done ( in increasing order of importance ) :
- Decrease default Flow FX 2D clocks to 250/250 instead of 300/300
- Fix all reported bugs. There seems to be a fair bit of them.
- 5% to 10% performance increase in no AA/AF performance.
- 4x AA mode with pure MSAA and rotated grid sampling. 4xS performance is probably going to be too low.
- AF mode which is a compromise between Aggressive and Balanced. Aggressive is too aggressive, and Balanced performance drop is too huge. We need something between both!
- Send me a NV35 prototype before every other tech site :D j/k
Right now, I'd give the GFFX a mere 65%. If those things were done, I would easily give it 80%, or even 85% if the performance increase they can do via drivers is more like 25%. That is, if I was a reviewer :p
No matter what, I'll probably buy a GFFX if at least the three first things are done. But I might decide not to take the Ultra, not sure if it's really worth the extra money.
Anyway, that's my opinion. As you can see, it's kinda different from most other people's opinion.
I hope you liked reading this. Yes, it's long, and maybe I could have made it shorter. But I really felt like saying all of this, I've been thinking a lot about it. Anyway, thanks for reading!
Uttar
I'd like to comment on the GFFX. But then, why does this deserve a new thread? Well, that's quite simply because I'm going to give a completely different reasoning, and conclusion, than everyone else :) And no, I'm not going to say the GFFX is the greatest thing ever. It isn't. So read on...
First of all, let me say Anand did excellent IQ tests. IMO, he just didn't interpret the performance / IQ correctly.
Let me begin by talking about Antialiasing. Let's see what the respective performance hits are, shall we?
http://www.anandtech.com/video/showdoc.html?i=1779&p=13
GFFX 4x AA: 24% performance hit
Radeon 9700 Pro 4x AA: 25% performance hit
As you can see, the GFFX AA is *cheaper* than the Radeon 9700 Pro. Why? After all, it got 20% less bandwidth...
The answer, sadly, isn't so pretty. The GFFX 4x AA patterns are ordered, giving lower IQ and more repeating colors. Notice the "more repeating color" part? That means the SAME compression algorithm would work better on the GFFX 4x than on the Radeon 9700 Pro 4x!
However, in most cases that shouldn't make much of a difference. So the GFFX performance hit with the Radeon 9700 Pro pattern might be more like 27% or something.
Now, why is 6xS so darn slow? The answer lies the the SSAA part of it. Color Compression is a LOT less efficient with that, because not all subpixels are identical. The Radeon 9700 Pro 6x AA got all subpixels identical, so it's really normal the performance is a lot better.
4x AA scores for the GFFX, in fact, are amazingly good seeing how the GFFX got 20% less memory bandwidth than the Radeon 9700 Pro. Very few reviews got 4x AA only scores, but two do. Hexus & Tom. Both sites conclusion is ridiculous and the AF comparaison is not fair at all ( ATI got a signifiant disadvantage ) , but I doubt the benchmarks for AA are tricked.
http://www.hexus.co.uk/review.php?review=497&page=8
http://www.tomshardware.com/graphic/20030127/geforce_fx-19.html#4x_fsaa
http://www.hexus.co.uk/review.php?review=497&page=6
As you can see, in the worst case the GFFX got a 11% lower score than the Radeon 9700 Pro, with 20% slower memory. On average, they're actually *on par* with only 4x AA!
Sounds like ATI is getting a little bit more waste than nVidia with their 256-bit memory bus.
Conclusion for AA: The GFFX got two big disadvantages. Worse 4x AA patterns, and no 6x true MSAA mode. The "no true 6x MSAA mode" is only a problem if you run at lower resolutions, because seeing the difference between 1600x1200 4x AA & 6x AA is just impossible IMO.
As for the worse 4x AA patterns. That's what 4xS is supposed to fix. But the problem is that 4xS also got SSAA in it, so performance will be a lot worse. That, thus, is a huge problem.
I already said the 2x AA quality is a bug in another thread, and that it should be fixed easily with drivers. So I won't count that as a disadvantage ( it'll be fixed when we'll get to buy the card )
Maybe 4x AA is really rotated, who knows. nVidia seems to claim that AA&AF things are happening after the frame buffer, so maybe that isn't captured correctly either. But until I get a proof of it, I refuse to trust that.
Aniso now... Let's look at the numbers.
http://www.anandtech.com/video/showdoc.html?i=1779&p=10
Now, we can see nVidia's biggest problem. The HUGE performance hit when using Balanced, and the bad IQ when using aggressive. The solution, for nVidia, would be to make an intermediary setting between Aggressive and Balanced, and I really hope they'll do so. However, I fear that's unlikely :(
Right now, there's really two things thatc can be done to compare Aniso. In case you want the very best quality, you compare nVidia's 8x Balanced with ATI's 8x Performance. There, the GFFX loses very badly.
The other solution is to compare nVidia's 8x Aggressive with ATI's 4x Performance: this gives significantly lower IQ, but the IQ is the same on both cards and the performance hit is also nearly identical.
Conclusion on Aniso: Right now, nVidia got a clear disadvantage here. It is the GFFX biggest problem. I urge nVidia to implement an intermediary mode between Aggressive and Balanced ( preferably via drivers, but if it's not possible that's essantial for the NV35 ) - otherwise, they've badly lost that battle.
Now, there are several other things people have complained about, or said which were quite anormal. What are those, you ask?
First of all, some of 3DMark's 2001 SE synthetic scores are *not* what most people would have supposed based on paper specs. So, what happened?
http://www.tomshardware.com/graphic/20030127/geforce_fx-27.html
The Radeon 9700 Pro actually scores a *higher* Single Texturing score! Why? Bandwidth.
With my GF4 Ti4200 at 250/250, my single-texturing score is of 861.8MP/s
At 250/200, my score drops to 726.6MP/s due to bandwidth limitations.
So, that one is explained and drivers won't be able to do much about it: 3DMark 2001 SE fillrate tests are ridiculous and false.
Now, there's also the Advanced Pixel Shader test which uses PS1.4. where ATI got a nice lead. I've got no data to back this reasoning, but anyway...
It was revealed that DX9 was not working in those drivers. That mean PS1.4. support is probably pretty early in them, and not much work was done on it yet. So it sounds reasonable to say that nVidia has done few optimizations, and it's the area they've focused the less in.
Thus, improvements will mostly be seen in such areas.
As for a slower VS speed than the Radeon 9700 Pro when using complex shading programs. It's hard to say, really. Those programs aren't *so* complex, and bandwidth might be an issue. Also, the GFFX shading architecture is really optimized for very long programs, so I really think those programs might not put it in its best light.
Of course, huge programs won't be seen in games for a while. So those synthetic benchies most certainly show us that the GFFX VS is a little bit inferior in games which are going to release soon.
Finally... The dustbuster, aka Flow FX. Well, the noise really seems bad. But I guess that was known since they announced the whole system. However, I'd like to see lower 2D clock rates. 300/300 is completely useless in 2D! 250/250 is really a maximum, and it would generate less noise. Okay, so you can change that yourself with coolbits. But not everyone is going to do that, and I think it would be a good change.
Also, it would be nice if Flow FX could automatically put the card to something more like 400/400 if it sees FPS is more than, for example, 100. That obviously can't be done via drivers, but it's already on my NV35 wishlist.
Conclusion: On the feature side of things, nVidia is a clear winner. The GFFX is future oriented, and got a lot of cool things for developers. It's unlikely, however, that they will be used in the very short life of the card.
AA Performance is on par with what I expected. AA IQ, however, isn't: I had hoped for a pure MSAA mode with a rotated grid. 4xS performance hit is probably just gonna be too big.
AF Performance is good too, but AF IQ is not. That means AF is really a lot worse than what I had expected.
nVidia got a fast algorithm with the 8x Aggressive mode, but its IQ is on par with ATI's 4x Performance mode and that's the best the adaptive modes can do! Thus, many reviewers choose to use "balanced" and that's why the GFFX is losing so much in all those benchmarks.
The dustbuster is a problem, but it's not huge.
For the GFFX to satisfy what I hoped for, the following should be done ( in increasing order of importance ) :
- Decrease default Flow FX 2D clocks to 250/250 instead of 300/300
- Fix all reported bugs. There seems to be a fair bit of them.
- 5% to 10% performance increase in no AA/AF performance.
- 4x AA mode with pure MSAA and rotated grid sampling. 4xS performance is probably going to be too low.
- AF mode which is a compromise between Aggressive and Balanced. Aggressive is too aggressive, and Balanced performance drop is too huge. We need something between both!
- Send me a NV35 prototype before every other tech site :D j/k
Right now, I'd give the GFFX a mere 65%. If those things were done, I would easily give it 80%, or even 85% if the performance increase they can do via drivers is more like 25%. That is, if I was a reviewer :p
No matter what, I'll probably buy a GFFX if at least the three first things are done. But I might decide not to take the Ultra, not sure if it's really worth the extra money.
Anyway, that's my opinion. As you can see, it's kinda different from most other people's opinion.
I hope you liked reading this. Yes, it's long, and maybe I could have made it shorter. But I really felt like saying all of this, I've been thinking a lot about it. Anyway, thanks for reading!
Uttar