NVIDIA L40

NVIDIA^® L40

SKU: NVL40TCGPU-KIT

Description

Certification Request

EOL Notification Form EOL Notification Form

NVIDIA L40

Unprecedented Visual Computing Performance for the Data Center

From virtual workstations to large-scale modeling and simulation, modern visual computing and scientific applications are growing in complexity and quantity. Enterprises need data center solutions that can deliver extreme performance and scale with versatile capabilities to conquer the diverse computing demands of these increasingly complex workloads.

The NVIDIA L40 delivers unprecedented visual computing performance for the data center, providing next-generation graphics, compute, and AI capabilities to GPU-accelerated applications. Based on the Ada Lovelace GPU architecture, the L40 features third-generation RT Cores that enhance real-time ray tracing capabilities, and fourth-generation Tensor Cores with support for the FP8 data format to deliver over a petaflop of inferencing performance. These new features are combined with the latest generation CUDA Cores and 48GB of graphics memory to accelerate visual computing workload from high-performance virtual workstation instances to large-scale digital twins in NVIDIA Omniverse. With up to twice the performance of the previous generation at the same power, the NVIDIA L40 is uniquely suited to provide the visual computing power required by the modern data center.

Performance Highlights
CUDA Cores	18176
Tensor Cores	568 \| Gen 4
RT Cores	142 \| Gen 3
GPU Memory	48 GB GDDR6 ECC
Memory Interface	384-bit
Memory Bandwidth	864 GB/s
Display Connectors	4x DP 1.4a
Maximum Digital Resolution	4x 5K at 60 Hz \| 2x 8K at 60 Hz 4x 4K at 120 Hz \| 30-bit Color
Form Factor	4.4" H x 10.5" L \| Dual Slot
Thermal Solution	Passive
Maximum Power Consumption	300 W

Scalable multi-workload performance

Next Generation Graphics

The NVIDIA L40 brings the highest level of power and performance for visual computing workloads in the data center. Third-generation RT Cores and industry-leading 48 GB of GDDR6 memory deliver up to twice the real-time ray tracing performance of the previous generation to accelerate high-fidelity creative workflows, including real-time, full-fidelity, interactive rendering, 3D design, video streaming, and virtual production — ideal for demanding Media and Entertainment professionals.

Computing Demonstrated on Laptops and GPUs

Powerful Compute and AI

Groundbreaking features in the NVIDIA L40 accelerate a wide range of compute-intensive workloads running in the data center, including training, inferencing, data science, and graphics applications. The latest fourth-generation Tensor Cores deliver enhanced AI capabilities to accelerate visual computing workloads and deliver groundbreaking performance for deep learning and inference applications.

Data Center-Ready

Designed for 24 x 7 enterprise data center operations and optimized to deploy at scale, the NVIDIA L40 utilizes enterprise-grade components, power-efficient hardware, and features like secure boot with internal root of trust. The L40 delivers the highest levels of performance and reliability for data center workloads. Packaged in a dual-slot power-efficient design, the L40 is available in a wide variety of NVIDIA-Certified Systems from leading system builders.

Structural Sparsity

AI networks are big, having millions to billions of parameters. Not all of these parameters are needed for accurate predictions, and some can be converted to zeros to make the models “sparse" without compromising accuracy. Tensor Cores in the NVIDIA L40 can provide up to 2x higher performance for sparse models. While the sparsity feature more readily benefits AI inference, it can also improve the performance of model training.

Every Deep Learning Framework, 700+ GPU-Accelerated Applications

The NVIDIA L40 GPU is a key component of the NVIDIA data center platform for deep learning, HPC, and data analytics. It accelerates every major deep learning framework and accelerates over 700 HPC applications. It's available everywhere, from desktops to servers to cloud services, delivering both dramatic performance gains and cost-saving opportunities.

Virtualization Capabilities

Virtualized compute workloads such as AI, Deep learning, FP32 high-performance computing (HPC), and photorealistic ray-traced graphics, even essential corporate applications and general-purpose productivity software (90% of which utilizes GPU acceleration) can be supported with NVIDIA vApps (vApps), NVIDIA vPC (vPC), and NVIDIA RTX Virtual Workstation (vWS).

Warranty

3-Year Limited Warranty

Free dedicated phone and email technical support
(1-800-230-0130)

Dedicated NVIDIA professional products Field Application Engineers

Resources

Contact gopny@pny.com for additional information.

Features

NVIDIA L40

BUILT TO PERFORM, WHATEVER THE DEMAND

NVIDIA Omniverse Enterprise

As the engine of NVIDIA^® Omniverse™ Enterprise in the data center, the NVIDIA L40 brings powerful RTX and AI capabilities to power workloads like extended (XR) and virtual reality (VR) applications, design collaboration, and digital twins. For the most complex Omniverse workloads, the NVIDIA L40 enables accelerated ray-traced and path-traced rendering of materials, physically-accurate simulations, and generating photorealistic 3D synthetic data.

Rendering and 3D Graphics

Running professional 3D visualization applications with NVIDIA L40 enables creative professionals to iterate more, render faster, and unlock tremendous performance advantages that increase productivity and speed up project completion. Artists and designers can work in real-time with complex geometry and high-resolution textures to generate photorealistic designs and simulation to power high-fidelity creative workflows.

High-Performance Virtual Workstations

When combined with NVIDIA RTX™ Virtual Workstation (vWS) software (support coming in early 2023), the NVIDIA L40 delivers powerful virtual workstations from the data center or cloud to any device. Millions of creative and technical professionals can access the most demanding applications from anywhere with awe-inspiring performance that rivals physical workstations — all while meeting the need for greater security.

AI Training and Data Science

Powerful training and inference performance, combined with enterprise-class stability and reliability, make NVIDIA L40 the ideal platform for single-GPU AI training and development. The NVIDIA L40 reduces the time to completion for model training and development and data science data preparation workflows by delivering higher throughput and support for a full range of precisions, including FP8.

Streaming and Video Content

The NVIDIA L40 takes streaming and video content workloads to the next level with three video encode and three video decode engines. With the addition of AV1 encoding, the L40 delivers breakthrough performance and improved TCO for broadcast streaming, video production, and transcription workflows.

Specifications

NVIDIA L40

SPECIFICATIONS

Product	NVIDIA L40
Architecture	NVIDIA Ada Lovelace Architecture
Process Size	4nm NVIDIA Custom Process \| TSMC
Transistors	76.3 Billion
Die Size	608.44 mm²
CUDA Cores	18176
Tensor Cores	568 \| Gen 4
RT Cores	142 \| Gen 3
GPU Memory	48 GB GDDR6 ECC
Memory Interface	384-bit
Memory Bandwidth	864 GB/s
Display Connectors	4x DP 1.4a
Maximum Digital Resolution	4x 5K at 60 Hz \| 2x 8K at 60 Hz 4x 4K at 120 Hz \| 30-bit Color
Form Factor	4.4" H x 10.5" L \| Dual Slot
Thermal Solution	Passive
Maximum Power Consumption	300 W
vGPU Software Support	NVIDIA vApps, vPC, vWS \| Early 2023
vGPU Profiles Supported	1 GB, 2 GB, 3 GB, 4 GB, 6 GB, 8 GB, 12 GB, 16 GB, 24 GB, 48 GB
Graphics APIs	DirectX 12 Ultimate, Shader Model 6.6, OpenGL 4.6, Vulkan 1.3
NVENC \| NVDEC	3x ENC \| 3x DEC \| Includes AV1 Encode and Decode
Compute APIs	CUDA 12.0, DirectCompute, OpenCL 3.0
NVIDIA 3D Vision and 3D Vision Pro	Support via Optional 3-pin mini-DIN Bracket
Frame Lock	Supported with optional NVIDIA Quadro Sync II
Power Connector	1x PCIe CEM5 16-pin
NEBS Ready	Level 3
Secure Boot with Root of Trust	Supported

Preliminary specifications only (subject to change).

SUPPORTED OPERATING SYSTEMS

Windows Server 2012 R2
Windows Server 2016 1607, 1709
Windows Server 2019
RedHat CoreOS 4.7
Red Hat Enterprise Linux 8.1-8.3

Red Hat Enterprise Linux 7.7-7.9
Red Hat Linux 6.6+
SUSE Linux Enterprise Server 15 SP2
SUSE Linux Enterprise Server 12 SP 3+
Ubuntu 14.04 LTS/16.04/18.04 LTS/20.04 LTS

WARRANTY

3-Year Limited Warranty
Free dedicated phone and email technical support (1.800.230.0130)

Dedicated NVIDIA professional products Field Application Engineers

RESOURCES

Product Brochure

Product Brief

PACKAGE CONTAINS

NVIDIA L40 PCIe Board

Auxiliary power cable

NVIDIA L40

NVIDIA® L40

NVIDIA L40

Unprecedented Visual Computing Performance for the Data Center

Performance Highlights

CUDA Cores

Tensor Cores

RT Cores

GPU Memory

Memory Interface

Memory Bandwidth

Display Connectors

Maximum Digital Resolution

Form Factor

Thermal Solution

Maximum Power Consumption

Scalable multi-workload performance

Next Generation Graphics

Powerful Compute and AI

Data Center-Ready

Structural Sparsity

Every Deep Learning Framework, 700+ GPU-Accelerated Applications

Virtualization Capabilities

Warranty

Resources

NVIDIA L40

BUILT TO PERFORM, WHATEVER THE DEMAND

NVIDIA Omniverse Enterprise

Rendering and 3D Graphics

High-Performance Virtual Workstations

AI Training and Data Science

Streaming and Video Content

NVIDIA L40

SPECIFICATIONS

Product

Architecture

Process Size

Transistors

Die Size

CUDA Cores

Tensor Cores

RT Cores

GPU Memory

Memory Interface

Memory Bandwidth

Display Connectors

Maximum Digital Resolution

Form Factor

Thermal Solution

Maximum Power Consumption

vGPU Software Support

vGPU Profiles Supported

Graphics APIs

NVENC | NVDEC

Compute APIs

NVIDIA 3D Vision and 3D Vision Pro

Frame Lock

Power Connector

NEBS Ready

Secure Boot with Root of Trust

SUPPORTED OPERATING SYSTEMS

WARRANTY

RESOURCES

PACKAGE CONTAINS

Related Products

NVIDIA L40S

NVIDIA A40

0 item added to cart

NVIDIA^® L40