NVIDIA Kepler GK110 Architecture White Paper

05-20-12, 09:00 PM
http://gpgpu.org/wp/wp-content/uploads/2012/05/nvidia_kepler2_die_shot-150x150.jpg (http://gpgpu.org/wp/wp-content/uploads/2012/05/nvidia_kepler2_die_shot.jpg)NVIDIA Kepler GK110 Die Shot

This white paper (http://www.nvidia.com/content/PDF/kepler/NVIDIA-Kepler-GK110-Architecture-Whitepaper.pdf)¬*describes the new Kepler ¬*GK110 Architecture from NVIDIA.

Comprising 7.1 billion transistors, Kepler GK110 is not only the fastest, but also the most architecturally¬*complex microprocessor ever built. Adding many new innovative features focused on compute¬*performance, GK110 was designed to be a parallel processing powerhouse for Tesla¬ģ¬*and the HPC¬*market.

Kepler GK110 will provide over 1 TFlop of double precision throughput with greater than 80% DGEMM¬*efficiency versus 60‚??65% on the prior Fermi architecture.

In addition to greatly improved performance, the Kepler architecture offers a huge leap forward in¬*power efficiency, delivering up to 3x the performance per watt of Fermi.

The paper describes features of the Kepler GK110 architecture, including

Dynamic Parallelism;
Grid Management Unit;
New SHFL instruction and atomic instruction enhancements;
New read-only data cache previously only accessible to texture;
Bindless Textures;
and much more.

