Nvidia GPU Graphic Card Comparison

The GPU (Graphics Processing Unit) plays a central part in machine learning. CPUs just don’t cut it. Hubert Yoshida from Hitachi described that CPUs are designed for a single purpose like transaction processing. On the other hand, GPUs were designed for multipurpose where they are able to process tasks and functions in parallel.

Google has developed its own product called the Cloud TPU. And Microsoft Azure has created AMD-powered NV4 instances for GPU-partitioning. However, the giant of the industry is Nvidia.

Renting GPUs from cloud providers is an expensive proposition for many organizations. The good news, there are lots of GPU options available for building your own AI workstation to do training, testing, and running machine learning models.

If deep learning is involved, a heftier GPU will be required because that’s a compute-intensive model. Some deep learning models require millions of calculations and parameters updates in run-time. For $2500 dollars, an engineer can acquire a GPU with 4,680 cores and 576 tensor cores. The Tensor cores are able to improve large matrix operations and do “mixed-precision matrix multiply and accumulate calculations in a single operation”.

Nvidia calls its latest GPU architecture Turing. And it’s the “greatest leap since the invention of CUDA GPU” in 2006, at least that’s what they say. An interesting feature is the real-time tracing that is able to project 3D environments to life. The Nvidia Titan RTX comes with 42GB of GDDR6 memory, 576 tensor cores, and supports 672 GB/s of memory bandwidth. Also, the NVLink features enable cards to be daisy-chained. Below is a list of Quadro of the GeForce cards.

NvidiaTitan RTXGeForceGeForceQuadroQuadroQuadroQuadro
SpecsTitan RTXRTX 2080 TiRTX 2080 SuperRTX 8000RTX 6000RTX 5000GV100
CUDA Cores4608435230724608460830725120
Tensor Cores576544384576576384640
TFLOPS Single Precision16.313.411.1516.316.311.214.8
Base Clock1350Mhz1350Mhz1650Mhz1395Mhz1440Mhz1620Mhz1132Mhz
Boost Clock1770Mhz1545Mhz1815Mhz1770Mhz1770Mhz1815Mhz1627Mhz
Memory Bandwidth672GB/s616GB/s496GB/s672GB/s672GB/s448GB/s868GB/s
