View Single Post
Old 02-26-09, 04:16 PM   #17
walterman
Rayne
 
walterman's Avatar
 
Join Date: Oct 2003
Posts: 1,525
Post Re: Anybody into CUDA ?

I have a new version.

It uses 2 methods: texture fetching / shared memory.

Quote:
BloodRayne 2 FSAA Patch - CUDA Perlin Benchmark Tool 0.15 Alpha
---------------------------------------------------------------

Running Benchmarks ...
----------------------
TF [128, 512] Total Time: 0.192191s
SM [128, 512] Total Time: 0.107221s
TF [256, 256] Total Time: 0.191223s
SM [256, 256] Total Time: 0.103024s
TF [512, 128] Total Time: 0.190813s
SM [512, 128] Total Time: 0.126796s
TF [1024, 64] Total Time: 0.189704s
SM [1024, 64] Total Time: 0.189470s
TF [2048, 32] Total Time: 0.189634s
SM [2048, 32] Total Time: 0.390741s
TF [4096, 16] Total Time: 0.198291s
SM [4096, 16] Total Time: 0.942939s
TF [8192, 8] Total Time: 0.255677s
SM [8192, 8] Total Time: 2.238887s
TF [16384, 4] Total Time: 0.435167s
SM [16384, 4] Total Time: 6.500768s
TF [32768, 2] Total Time: 0.856746s
SM [32768, 2] Total Time: 22.913676s

Best Config (Shared Memory) [256, 256]: 0.103024s

Running Verification Test at (Shared Memory) [256, 256] ...
------------------------------------------------------------
Everything OK

BloodRayne 2 FSAA Patch - CUDA Perlin Benchmark Tool 0.15 Alpha
---------------------------------------------------------------

Running Benchmarks ...
----------------------
TF [512, 512] Total Time: 0.697142s
SM [512, 512] Total Time: 0.370065s
TF [1024, 256] Total Time: 0.692771s
SM [1024, 256] Total Time: 0.374493s
TF [2048, 128] Total Time: 0.690623s
SM [2048, 128] Total Time: 0.464357s
TF [4096, 64] Total Time: 0.688960s
SM [4096, 64] Total Time: 0.712639s
TF [8192, 32] Total Time: 0.690871s
SM [8192, 32] Total Time: 1.504626s
TF [16384, 16] Total Time: 0.702908s
SM [16384, 16] Total Time: 3.673776s
TF [32768, 8] Total Time: 0.974903s
SM [32768, 8] Total Time: 8.863379s

Best Config (Shared Memory) [512, 512]: 0.370065s

Running Verification Test at (Shared Memory) [512, 512] ...
------------------------------------------------------------
Everything OK

BloodRayne 2 FSAA Patch - CUDA Perlin Benchmark Tool 0.15 Alpha
---------------------------------------------------------------

Running Benchmarks ...
----------------------
TF [2048, 512] Total Time: 2.361321s
SM [2048, 512] Total Time: 1.282897s
TF [4096, 256] Total Time: 2.369100s
SM [4096, 256] Total Time: 1.336673s
TF [8192, 128] Total Time: 2.361914s
SM [8192, 128] Total Time: 1.677807s
TF [16384, 64] Total Time: 2.360700s
SM [16384, 64] Total Time: 2.639611s
TF [32768, 32] Total Time: 2.360139s
SM [32768, 32] Total Time: 5.692210s

Best Config (Shared Memory) [2048, 512]: 1.282897s

Running Verification Test at (Shared Memory) [2048, 512] ...
------------------------------------------------------------
Everything OK
It's 6.5x times faster than the CPU. It will be hard to make it faster.

You can leech it here: http://www.speedyshare.com/455357158.html

I have problems to run it on my old G80. If somebody can try it, i would like to know if it works with other cards.
__________________
ASUS Rampage Formula X48 | Xeon 3350 @ 3.6 GHz (450x8/1.26v) | 4x1GB OCZ DDR2 PC2-6400 Reaper CL3 @ 900 MHz 3-4-4-15 | 1 x eVGA GTX 285 SSC | 1 x ASUS EN8800GTX (PhysX/CUDA -> Burnt by nVidia 196.75 driver) | X-Fi Titanium Fatal1ty PCIe | 1 x Intel X25-M G2 80GB | 2 x 750GB WD RE2 7500AYYS SATA2 16MB | Samsung SH-B083L SATA | Enermax Revolution 1250W | Samsung SyncMaster 275T 27" 1920x1200 | Thermaltake Black Armor | BloodRayne 2 FSAA Patch
walterman is offline   Reply With Quote