NVIDIA L40S
Unparalleled AI and Graphics Performance for the Data Center
The NVIDIA L40S GPU, based on the Ada Lovelace architecture, is the most powerful universal GPU for the data center, delivering breakthrough multi-workload acceleration for large language model (LLM) inference and training, graphics, and video applications. As the premier platform for multi-modal generative AI, the L40S GPU provides end-to-end acceleration for inference, training, graphics, and video workflows to power the next generation of AI-enabled audio, speech, 2D, video, and 3D applications.
Specifications1
GPU Architecture
|
NVIDIA Ada Lovelace Architecture |
GPU Memory
|
48GB GDDR6 with ECC |
Memory Bandwidth
|
864 GB/s |
CUDA® Cores
|
18,176 |
RT Cores
|
142 |
Tensor Cores
|
568 |
RT Core Performance
|
212 TFLOPS |
FP32
|
91.6 TFLOPS |
TF32 Tensor Core
|
366 TFLOPS |
BFLOAT16 Tensor Core
|
366 I 7332 TFLOPS |
FP16 Tensor Core
|
366 I 7332 TFLOPS |
FP8 Tensor Core
|
733 I 14662 TFLOPS |
Peak INT8 Tensor
|
733 I 14662 TOPS |
Peak INT4 Tensor
|
733 I 14662 TOPS |
Form Factor
|
4.4" (H) x 10.5" (L), dual slot |
Display Ports
|
4 x DisplayPort 1.4a |
Max Power Consumption
|
350W |
Power Connector
|
16-pin |
Thermal
|
Passive |
Virtual GPU (vGPU) Software Support
|
Yes |
NVENC I NVDEC
|
3x l 3x (Includes AV1 Encode and Decode) |
Secure Boot with Root of Trust
|
Yes |
NEBS Ready
|
Level 3 |
MIG Support
|
No |
Warranty
Free dedicated phone and email technical support
(1-800-230-0130)
Dedicated NVIDIA professional products Field Application Engineers
Resources
Contact gopny@pny.com for additional information.