NVIDIA DGX Spark™ is part of a new class of computers designed to build and run AI, enabling developers to locally prototype, fine-tune, and inference large AI models.
NVIDIA DGX Spark™ is part of a new class of computers designed to build and run AI, delivering up to 1 petaflop of AI performance from a power-efficient, compact form factor. With the preinstalled NVIDIA AI software stack and 128 GB of memory, developers can locally prototype, fine-tune, and inference large AI models of up to 200 billion parameters and 405 billion parameters when connecting two DGX Spark’s locally and use NVIDIA NemoClaw to build and run autonomous agents safely.
From novice to expert, leverage NVIDIA’s curated playbooks to get off to a running start on various projects. Playbooks contain step-by-step recipes to demonstrate the art of the possible.
A cloud-native suite of software tools, libraries, and frameworks accelerating production-grade AI development. Combines enterprise-grade security, optimized performance, and enterprise-level support to streamline prototype-to-production for next-gen agentic AI.
NVIDIA GPU, CPU, Networking, and AI Software Technologies
Experience up to 1 PFLOP of AI performance at FP4 precision with the NVIDIA Grace Blackwell architecture.
Run AI development and testing workloads with AI models up to 200 billion parameters at your desktop with large, unified system memory.
High-performance NVIDIA Connect-X networking enables connecting two NVIDIA DGX Spark systems together to work with AI models up to 405 billion parameters.
Use a full-stack solution for generative AI workloads, including tools, frameworks, libraries, and pretrained models.
*Preliminary specifications, subject to change1Theoretical FP4 TFLOPS using the sparsity feature.2 TDP: Thermal Design Power of the GB10 chip including CPU and GPU
The NVIDIA Grace Blackwell architecture delivers a powerful system-on-a-chip (SoC) solution combining NVIDIA Blackwell GPU technology with Arm CPU cores to deliver supercomputer level performance in an efficient, compact SoC.
The GB10 provides 20 Arm cores, 10 Cortex-X925 cores and 10 Cortex-A725 cores.
The GB10 Blackwell GPU provides 6144 CUDA Cores to accelerate graphics and FP32 compute workloads, 5th generation Tensor cores that provide up to 1,000 AI TOPS1 1Theoretical FP4 TOPS performance using sparsity feature of FP4 performance, and 4th generation RT cores that provide real time ray tracing acceleration. The GPU also includes 5th generation NVDEC and 9th generation NVENC engines for accelerated encoding and decoding of MPEG-2, VC-1, H.264 (AVCHD), H.265 (HEVC), VP8, VP9, and AV1 video formats, including support for 4:2:2 H.264 and H.265 video.
The DGX Spark features 128GB of coherent, unified LPDDR5x system memory. Accessible to both CPU and GPU, DGX Spark enables users to work with large workloads including AI models of up to 200 billion parameters.
Equipped with a ConnectX-7 Smart NIC, the DGX Spark provides up to 200 gigabit ethernet (GbE). Connecting two DGX Spark systems via the ConnectX-7 networking port enables users to work with AI models of up to 405 billion parameters.
With the NVIDIA DGX OS pre-installed, the DGX Spark development environment will be familiar to DGX developers. Including Ubuntu Linux, the DGX OS system software makes it easy to move work to DGX Cloud or any accelerated data center or cloud infrastructure. Installed software also includes RTX toolkits and libraries, supporting NVIDIA RTX visualization technologies.
In addition to the DGX OS, DGX Spark includes the NVIDIA AI software stack which includes CUDA, CUDA-X toolkits and libraries to enable AI developers to quickly bring up their AI projects and workflows. Users can install their favorite AI frameworks, SDKs, and services such as PyTorch, TensorFlow, or Jupyter Notebooks or take advantage of NVIDIA platforms and frameworks such as NVIDIA Isaac Sim™, NVIDIA Metropolis, or NVIDIA Holoscan and NVIDIA NIMs and AI Blueprint example workflows.
The NVIDIA DGX Spark is a compact desktop system, measuring 150 mm L x 150 mm W x 50.5 mm H, able to fit onto any desktop with minimal space required. The system is designed to look like the original DGX 1 system, the original AI supercomputer that was first delivered to customers in 2016, making it an attractive addition to any workspace.
Sign up to receive exclusive updates, virtual events, and insights on NVIDIA IGX Orin and advancements in Edge AI.
Sign Up
Contact gopny@pny.com for additional information.