12.07.2015 Views

Tesla Kepler Family Product Overview - Nvidia

Tesla Kepler Family Product Overview - Nvidia

Tesla Kepler Family Product Overview - Nvidia

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>Tesla</strong> ® <strong>Kepler</strong> GPU AcceleratorsIntroducing the world’s fastest accelerators.NVIDIA <strong>Tesla</strong> K-series GPU Accelerators are based on the NVIDIA <strong>Kepler</strong> compute architecture and powered by CUDA ® , the world’smost pervasive parallel computing model. They include innovative technologies like Dynamic Parallelism and Hyper-Q to boostperformance as well as power efficiency and deliver record application speeds for seismic processing, biochemistry simulations,weather and climate modeling, image, video and signal processing, computational finance, computational physics, CAE, CFD, and dataanalytics.The innovative <strong>Kepler</strong> computearchitecture design includes:SMX (streaming multiprocessor) designthat delivers up to 3x more performanceper watt compared to the SM inFermi 1 . It also delivers one petaflop ofcomputing in just ten server racks.Dynamic Parallelism capability thatenables GPU threads to automaticallyspawn new threads. By adapting to thedata without going back to the CPU, itgreatly simplifies parallel programming.Plus it enables GPU acceleration of abroader set of popular algorithms, likeadaptive mesh refinement (AMR), fastmultipole method (FMM), and multigridmethods.Hyper-Q feature that enables multipleCPU cores to simultaneously utilize theCUDA cores on a single <strong>Kepler</strong> GPU. Thisdramatically increases GPU utilization,slashes CPU idle times, and advancesprogrammability—ideal for clusterapplications that use MPI.The <strong>Tesla</strong> K-series family of products includes:<strong>Tesla</strong> K10 GPU Accelerator – Optimized for singleprecision applications, the <strong>Tesla</strong> K10 includes twoultra-efficient GK104 <strong>Kepler</strong> GPUs to deliver highthroughput. It delivers up to 2x the performancefor single precision applications compared to theprevious generation <strong>Tesla</strong> M2090 GPU in the samepower envelope. With an aggregate performance of4.58 teraflop peak single precision and 320 gigabytesper second memory bandwidth for both GPUs puttogether, the <strong>Tesla</strong> K10 is optimized for computationsin seismic, signal image processing, and videoanalytics.<strong>Tesla</strong> K20 and K20X GPU Accelerators – Designedto be the performance leader in double precisionapplications and the broader supercomputing market,the <strong>Tesla</strong> K20 and K20X GPU Accelerators deliver 10xthe performance of a single CPU 2 . <strong>Tesla</strong> K20 and K20Xboth feature a single GK110 <strong>Kepler</strong> GPU that includesthe Dynamic Parallelism and Hyper-Q features.With more than one teraflop peak double precisionperformance, these GPU accelerators are ideal forthe most aggressive high-performance computingworkloads including climate and weather modeling,CFD, CAE, computational physics, biochemistrysimulations, and computational finance.1Based on DGEMM performance: <strong>Tesla</strong> M2090 = 410 gigaflops, <strong>Tesla</strong> K20 (expected) > 1000 gigaflops2Based on WS-LSMS performance comparison between single E5-2687W @ 3.10GHz vs single <strong>Tesla</strong> K20X. <strong>Tesla</strong> K20X > 650 gigaflopsNVIDIA <strong>Tesla</strong> K-Series | Datasheet | Oct12

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!