Originally Posted by gio___
FDTD3d, Throughput = 179030.9718 MPoints/s, Time = 0.00002 s, Size = 2820096 Points, NumDevsUsed = 1, Blocksize = 256
...and it reported PASSED at the end? This number would imply a minimum memory bandwidth of 716 GB/s (the Tesla C1060 has a peak of 106, and the gtx60m has 61; good if we can sustain 10 or so) and the FDTD algo is not quite that efficient. Something's not quite right...