|
|
#13 | |
|
Official pain in the ass
|
Thanks for the input mate.
I'll give this a go whenever I get some free time... which is no time soon ![]()
__________________
My avatar: "An obscure and non sensical pattern made to get people to post what they think it is" - Vamp Dell XPS M1530 Core2Duo T5550 1.86Ghz | 3GB DDR2 | 8600M GT 256MB | 250GB Hdd | BenQ FP202W 20" LCD | MX1000 mouse | Inspire T5400 5.1 |
|
|
|
|
|
|
#14 | |
|
Rayne
Join Date: Oct 2003
Posts: 1,525
|
After a long night of fighting vs CUDA, i managed to finish the first alpha version of my benchmarking tool.
You can get the tool here: http://www.speedyshare.com/566199380.html Unrar, exec 'run_tests.bat', and you should see something like this: ![]() The best time of my GTX285 in the 256x256 test was 0.024s, and the best time of my 3.6 GHz Xeon 3350 Quad was 0.65s (using the old tool that comes with my br2 patch). So, my GPU is running around 27x times faster than my CPU in this test. Finally, good results. There is still a lot of room for optimizations, so, this is going really well. There are some problems with the FP 'precision'. The ALUs in the GPU do not follow the IEEE FP standards, and there are some errors in the 1024x1024 test. I would like to see your results.
__________________
ASUS Rampage Formula X48 | Xeon 3350 @ 3.6 GHz (450x8/1.26v) | 4x1GB OCZ DDR2 PC2-6400 Reaper CL3 @ 900 MHz 3-4-4-15 | 1 x eVGA GTX 285 SSC | 1 x ASUS EN8800GTX (PhysX/CUDA -> Burnt by nVidia 196.75 driver) | X-Fi Titanium Fatal1ty PCIe | 1 x Intel X25-M G2 80GB | 2 x 750GB WD RE2 7500AYYS SATA2 16MB | Samsung SH-B083L SATA | Enermax Revolution 1250W | Samsung SyncMaster 275T 27" 1920x1200 | Thermaltake Black Armor | BloodRayne 2 FSAA Patch |
|
|
|
|
|
|
#15 |
|
Rayne
Join Date: Oct 2003
Posts: 1,525
|
Well, i was doing something bad, and the results of the test are not valid.
The kernel was not running, because i was selecting bad <<numBlocks, numThreads>>. The only real results for the 1st run are: [128,512] to [32768,8]. So, the GPU is only running 1.25x times faster than the CPU, which is a really bad result. I have a new version, in which i get 2.5x times faster than the CPU. This has been a big disappointment. Note: I have deleted the dl link.
__________________
ASUS Rampage Formula X48 | Xeon 3350 @ 3.6 GHz (450x8/1.26v) | 4x1GB OCZ DDR2 PC2-6400 Reaper CL3 @ 900 MHz 3-4-4-15 | 1 x eVGA GTX 285 SSC | 1 x ASUS EN8800GTX (PhysX/CUDA -> Burnt by nVidia 196.75 driver) | X-Fi Titanium Fatal1ty PCIe | 1 x Intel X25-M G2 80GB | 2 x 750GB WD RE2 7500AYYS SATA2 16MB | Samsung SH-B083L SATA | Enermax Revolution 1250W | Samsung SyncMaster 275T 27" 1920x1200 | Thermaltake Black Armor | BloodRayne 2 FSAA Patch |
|
|
|
|
|
#16 | |
|
Rayne
Join Date: Oct 2003
Posts: 1,525
|
Quote:
You can leech it here: http://www.megaupload.com/?d=Y9M8S5FB
__________________
ASUS Rampage Formula X48 | Xeon 3350 @ 3.6 GHz (450x8/1.26v) | 4x1GB OCZ DDR2 PC2-6400 Reaper CL3 @ 900 MHz 3-4-4-15 | 1 x eVGA GTX 285 SSC | 1 x ASUS EN8800GTX (PhysX/CUDA -> Burnt by nVidia 196.75 driver) | X-Fi Titanium Fatal1ty PCIe | 1 x Intel X25-M G2 80GB | 2 x 750GB WD RE2 7500AYYS SATA2 16MB | Samsung SH-B083L SATA | Enermax Revolution 1250W | Samsung SyncMaster 275T 27" 1920x1200 | Thermaltake Black Armor | BloodRayne 2 FSAA Patch |
|
|
|
|
|
|
#17 | |
|
Rayne
Join Date: Oct 2003
Posts: 1,525
|
I have a new version.
It uses 2 methods: texture fetching / shared memory. Quote:
You can leech it here: http://www.speedyshare.com/455357158.html I have problems to run it on my old G80. If somebody can try it, i would like to know if it works with other cards.
__________________
ASUS Rampage Formula X48 | Xeon 3350 @ 3.6 GHz (450x8/1.26v) | 4x1GB OCZ DDR2 PC2-6400 Reaper CL3 @ 900 MHz 3-4-4-15 | 1 x eVGA GTX 285 SSC | 1 x ASUS EN8800GTX (PhysX/CUDA -> Burnt by nVidia 196.75 driver) | X-Fi Titanium Fatal1ty PCIe | 1 x Intel X25-M G2 80GB | 2 x 750GB WD RE2 7500AYYS SATA2 16MB | Samsung SH-B083L SATA | Enermax Revolution 1250W | Samsung SyncMaster 275T 27" 1920x1200 | Thermaltake Black Armor | BloodRayne 2 FSAA Patch |
|
|
|
|
|
|
#18 | |
|
Desperate code
Join Date: Jan 2008
Posts: 790
|
how do you learn this stuff? I have always wanted to learn about software development but never knew where to start
![]()
__________________
If you attack a prostitute, is it rape or merely shoplifting? |
|
|
|
|
|
|
#19 | |
|
Rayne
Join Date: Oct 2003
Posts: 1,525
|
Quote:
Personally, i started with the BASIC of my Sinclair ZX Spectrum, 25y ago, and i haven't stopped learning yet. Time & patience make the master.
__________________
ASUS Rampage Formula X48 | Xeon 3350 @ 3.6 GHz (450x8/1.26v) | 4x1GB OCZ DDR2 PC2-6400 Reaper CL3 @ 900 MHz 3-4-4-15 | 1 x eVGA GTX 285 SSC | 1 x ASUS EN8800GTX (PhysX/CUDA -> Burnt by nVidia 196.75 driver) | X-Fi Titanium Fatal1ty PCIe | 1 x Intel X25-M G2 80GB | 2 x 750GB WD RE2 7500AYYS SATA2 16MB | Samsung SH-B083L SATA | Enermax Revolution 1250W | Samsung SyncMaster 275T 27" 1920x1200 | Thermaltake Black Armor | BloodRayne 2 FSAA Patch |
|
|
|
|
|
|
#20 |
|
Desperate code
Join Date: Jan 2008
Posts: 790
|
Much respect for you dude.
__________________
If you attack a prostitute, is it rape or merely shoplifting? |
|
|
|
|
|
#21 |
|
Registered User
Join Date: May 2006
Posts: 544
|
First start with some VERY basic examples. Good languages to start with are C# and maybe Java, however I do suggest C#. After going through some more difficult C# you can take a look at C++. There are some pretty good tutorials available on how to start.
__________________
Creator of Driver Sweeper |
|
|
|
|
|
#22 | |
|
Registered User
|
Quote:
To start, either go with Java and/or C++ if you want to begin with a OO language, or go with (clean) C. C has the advantage of not letting any wrongdoing go unnoticed. You have to be very careful with memory allocation/deallocation (think explicit garbage collection), otherwise you risk ending up with segfaults really quickly. You have to thoroughly understand pointers and how things are stored in memory, which is always a good thing. And when you're ready, you can delve in more difficult to master techniques, like e.g. pointer arithmetic. If you want to start with C, look for the abundant online tutorials and the K&R book. For C++, the Stroustup can be a little hard in some points, but it's still one of the best books out there. |
|
|
|
|
|
|
#23 | |
|
Rayne
Join Date: Oct 2003
Posts: 1,525
|
A new "beta" version:
http://www.speedyshare.com/366621969.html It will benchmark your CPU vs your GPU. It supports multi-GPU rigs too. You can specify from the command line, the number of GPUs to use. You will need to disable SLI to use multiple GPUs in CUDA, according to nVidia papers. Examples: br2perlin 1 5 -> This will use just 1 GPU br2perlin 2 5 -> This will use 2 GPUs The library also supports mixing the CPU & GPU at the same time. In theory, when i designed it, i thought that CPU+GPU was going to be faster, but, due to the asynchronous nature of CUDA, it ends slower than the CPU or GPU alone. My BR2 Patch is using the new CUDA code now, and the perlin effects run on the GPU now. Unluckily, if you only have 1 gfx card, this is not a good idea, because the framerate is lower due to the resources used for the CUDA calculations. But, if you have 2 gfx cards, you won't lose any fps, and the perlin code will run faster in the GPU (bigger & more complex effects). Basically, i've written this to use my old 8800GTX to run the Perlin effects, and my GTX285 to render the shiny graphics at 1920x1200 SSAA 2x ![]() The results of my Xeon 3350 @ 3.6 GHz + eVGA GTX 285 SSC: Quote:
__________________
ASUS Rampage Formula X48 | Xeon 3350 @ 3.6 GHz (450x8/1.26v) | 4x1GB OCZ DDR2 PC2-6400 Reaper CL3 @ 900 MHz 3-4-4-15 | 1 x eVGA GTX 285 SSC | 1 x ASUS EN8800GTX (PhysX/CUDA -> Burnt by nVidia 196.75 driver) | X-Fi Titanium Fatal1ty PCIe | 1 x Intel X25-M G2 80GB | 2 x 750GB WD RE2 7500AYYS SATA2 16MB | Samsung SH-B083L SATA | Enermax Revolution 1250W | Samsung SyncMaster 275T 27" 1920x1200 | Thermaltake Black Armor | BloodRayne 2 FSAA Patch |
|
|
|
|
|
|
#24 |
|
Official pain in the ass
|
Hey mate, you think you could whip up a simple tutorial on how to work with CUDA?
I know several people here, including myself, would greatly appreciate it. ![]()
__________________
My avatar: "An obscure and non sensical pattern made to get people to post what they think it is" - Vamp Dell XPS M1530 Core2Duo T5550 1.86Ghz | 3GB DDR2 | 8600M GT 256MB | 250GB Hdd | BenQ FP202W 20" LCD | MX1000 mouse | Inspire T5400 5.1 |
|
|
|
![]() |
| Thread Tools | |
|
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| New MacBook Pros Make For Great CUDA Dev Platforms | News | Latest Tech And Game Headlines | 0 | 06-15-12 12:40 AM |
| GPU-sphere Swells With 26 New CUDA Centers | News | Latest Tech And Game Headlines | 0 | 06-11-12 01:50 PM |
| Allinea DDT CUDA Education Pack for Student Programmers | News | Latest Tech And Game Headlines | 0 | 05-25-12 08:00 AM |
| Panoptes: A Binary Translation Framework for CUDA | News | Latest Tech And Game Headlines | 0 | 05-22-12 04:50 AM |
| First Achievement Award Bestowed By CUDA Centers of Excellence | News | Latest Tech And Game Headlines | 0 | 05-17-12 07:40 AM |