640 TENSOR CORES An Exponential Leap in Performance Every industry needs AI, and with this massive leap forward in speed, AI can now be applied to every industry. Equipped with 640 Tensor Cores, Volta delivers over 100 Teraflops per second (TFLOPS) of deep learning performance, over a 5X increase compared to prior generation NVIDIA Pascal architecture.
NEW GPU ARCHITECTURE Engineered for the Modern Computer Humanity’s greatest challenges will require the most powerful computing engine for both computational and data science. With over 21 billion transistors, Volta is the most powerful GPU architecture the world has ever seen. It pairs NVIDIA CUDA and Tensor Cores to deliver the performance of an AI supercomputer in a GPU.
NEXT GENERATION NVLINK Scalability for Rapid Time-to-Solution Volta uses next generation revolutionary NVIDIA NVLink high-speed interconnect technology. This delivers 2X the throughput, compared to the previous generation of NVLink. This enables more advanced model and data parallel approaches for strong scaling to achieve the absolute highest application performance.
VOLTA-OPTIMIZED SOFTWARE GPU-Accelerated Frameworks and Applications Data scientists are often forced to make trade-offs between model accuracy and longer run-times. With Volta-optimized CUDA and NVIDIA Deep Learning SDK libraries like cuDNN, NCCL, and TensorRT, the industry’s top frameworks and applications can easily tap into the power of Volta. This propels data scientists and researchers towards discoveries faster than before.