09.05.2023 Views

pdfcoffee

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Tensor Processing Unit

If you want to have a look at TPU performance compared to GPUs and CPUs, we

can refer Jouppi et al., 2014 [3] and see (in a log-log scale graph) that the performance

is two orders of magnitude higher than a Tesla K80 GPU.

The graph shows a "rooftop" performance that is growing until the

point where it reaches the peak and then it is constant. The higher

the roof the merrier for performance.

Figure 4: TPU v1 peak performance can be up to 3x higher than a Tesla K80

Second-generation TPU

The second-generation TPUs (TPU2) were announced in 2017. In this case, the

memory bandwidth is increased to 600 GB/s and performance reaches 45 TFLOPS.

4 TPU2s are arranged in a module with 180 TFLOPS performance. Then 64 modules

are grouped into a pod with 11.5 PFLOPS of performance. TPU2s adopt floatingpoint

arithmetic and therefore they are suitable for both training and inference.

[ 576 ]

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!