NVIDIA has launched the DGX SuperPOD, a powerful AI supercomputer designed for trillion-parameter models and generative AI training, featuring a liquid-cooled architecture and 11.5 exaflops of AI supercomputing power.
- The NVIDIA DGX SuperPOD is designed to process trillion-parameter models and support superscale generative AI training and inference workloads.
- The system features a liquid-cooled rack-scale architecture that is highly efficient and provides constant uptime.
- The DGX SuperPOD is equipped with NVIDIA DGX GB200 systems, offering 11.5 exaflops of AI supercomputing power and 240 terabytes of fast memory.
nVidia has launched its latest AI supercomputer, the NVIDIA DGX SuperPOD powered by NVIDIA GB200 Grace Blackwell Superchips. This supercomputer is designed to process trillion-parameter models and support superscale generative AI training and inference workloads. The DGX SuperPOD features a liquid-cooled rack-scale architecture that is highly efficient and provides constant uptime. It is equipped with NVIDIA DGX GB200 systems, offering 11.5 exaflops of AI supercomputing power and 240 terabytes of fast memory. The system can be scaled further with additional racks.
Each DGX GB200 system is equipped with 36 NVIDIA GB200 Superchips, which include 36 NVIDIA Grace CPUs and 72 NVIDIA Blackwell GPUs. These Superchips are connected as one supercomputer via fifth-generation NVIDIA NVLink. The GB200 Superchips deliver up to a 30x performance increase compared to the previous generation for large language model inference workloads.
Jensen Huang, founder and CEO of NVIDIA, described the DGX AI supercomputers as the “factories of the AI industrial revolution.” He emphasized that the new DGX SuperPOD combines the latest advancements in NVIDIA accelerated computing, networking, and software to enable companies, industries, and countries to refine and generate their own AI.
The DGX SuperPOD powered by Grace Blackwell features eight or more DGX GB200 systems and can scale to tens of thousands of GB200 Superchips connected via NVIDIA Quantum InfiniBand. To provide a massive shared memory space for next-generation AI models, customers can deploy a configuration that connects the 576 Blackwell GPUs in eight DGX GB200 systems via NVLink.
The new DGX SuperPOD architecture features a unified compute fabric that includes fifth-generation NVIDIA NVLink, NVIDIA BlueField-3 DPUs, and support for NVIDIA Quantum-X800 InfiniBand networking. This architecture provides up to 1,800 gigabytes per second of bandwidth to each GPU in the platform. It also incorporates fourth-generation NVIDIA Scalable Hierarchical Aggregation and Reduction Protocol (SHARP) technology, which offers a 4x increase in In-Network Computing compared to the previous generation.
The DGX SuperPOD is designed as a complete, data-center-scale AI supercomputer that integrates with high-performance storage from NVIDIA-certified partners. It is built, cabled, and tested in the factory to accelerate deployment at customer data centers. The Grace Blackwell-powered DGX SuperPOD features intelligent predictive-management capabilities that continuously monitor hardware and software to predict and intercept sources of downtime and inefficiency. This helps save time, energy, and computing costs.
The software used in the DGX SuperPOD can identify areas of concern, plan for maintenance, adjust compute resources flexibly, and automatically save and resume jobs to prevent downtime. It can even activate standby capacity if a replacement component is needed to ensure work finishes on time. Hardware replacements can be scheduled to avoid unplanned downtime.
In addition to the DGX SuperPOD, NVIDIA also introduced the DGX B200 system, a unified AI supercomputing platform for AI model training, fine-tuning, and inference. The DGX B200 is the sixth generation of air-cooled, traditional rack-mounted DGX designs and includes eight NVIDIA Blackwell GPUs and two 5th Gen Intel Xeon processors. Customers can build DGX SuperPOD using DGX B200 systems to create AI Centers of Excellence that can support large teams of developers running multiple jobs.
The DGX B200 systems offer up to 144 Petaflops of AI performance, 1.4 TB of GPU memory, and 64 TB/s of memory bandwidth. They include advanced networking with eight NVIDIA ConnectX-7 NICs and two BlueField-3 DPUs, providing up to 400 gigabits per second bandwidth per connection. The software included with the DGX platforms is NVIDIA AI Enterprise software, which offers pretrained models, frameworks, toolkits, and microservices to accelerate development and deployment.
NVIDIA DGX SuperPOD with DGX GB200 and DGX B200 systems are expected to be available later this year from NVIDIA’s global partners.
About Our Team
Our team comprises industry insiders with extensive experience in computers, semiconductors, games, and consumer electronics. With decades of collective experience, we’re committed to delivering timely, accurate, and engaging news content to our readers.
Background Information
About Intel:
Intel Corporation, a global technology leader, is for its semiconductor innovations that power computing and communication devices worldwide. As a pioneer in microprocessor technology, Intel has left an indelible mark on the evolution of computing with its processors that drive everything from PCs to data centers and beyond. With a history of advancements, Intel's relentless pursuit of innovation continues to shape the digital landscape, offering solutions that empower businesses and individuals to achieve new levels of productivity and connectivity.Latest Articles about Intel
About nVidia:
NVIDIA has firmly established itself as a leader in the realm of client computing, continuously pushing the boundaries of innovation in graphics and AI technologies. With a deep commitment to enhancing user experiences, NVIDIA's client computing business focuses on delivering solutions that power everything from gaming and creative workloads to enterprise applications. for its GeForce graphics cards, the company has redefined high-performance gaming, setting industry standards for realistic visuals, fluid frame rates, and immersive experiences. Complementing its gaming expertise, NVIDIA's Quadro and NVIDIA RTX graphics cards cater to professionals in design, content creation, and scientific fields, enabling real-time ray tracing and AI-driven workflows that elevate productivity and creativity to unprecedented heights. By seamlessly integrating graphics, AI, and software, NVIDIA continues to shape the landscape of client computing, fostering innovation and immersive interactions in a rapidly evolving digital world.Latest Articles about nVidia
Technology Explained
GPU: GPU stands for Graphics Processing Unit and is a specialized type of processor designed to handle graphics-intensive tasks. It is used in the computer industry to render images, videos, and 3D graphics. GPUs are used in gaming consoles, PCs, and mobile devices to provide a smooth and immersive gaming experience. They are also used in the medical field to create 3D models of organs and tissues, and in the automotive industry to create virtual prototypes of cars. GPUs are also used in the field of artificial intelligence to process large amounts of data and create complex models. GPUs are becoming increasingly important in the computer industry as they are able to process large amounts of data quickly and efficiently.
Latest Articles about GPU
Grace Blackwell: Grace Blackwell is a cutting-edge technology that has revolutionized the computer industry. It is a type of artificial intelligence that is designed to mimic human cognitive abilities, such as learning, problem-solving, and decision-making. This technology has been applied in various areas of the computer industry, including data analysis, natural language processing, and machine learning. For example, Grace Blackwell can analyze large amounts of data and identify patterns and trends, making it a valuable tool for businesses to make informed decisions. It can also understand and respond to human language, making it useful for virtual assistants and chatbots. Additionally, Grace Blackwell can continuously learn and improve its performance, making it an invaluable asset in the development of new technologies. Overall, Grace Blackwell has greatly enhanced the capabilities of computers and has opened up new possibilities for the future of technology.
Latest Articles about Grace Blackwell
Petaflops: Petaflops is a measure of computing speed, specifically one quadrillion floating-point operations per second. This technology is used to measure the performance of supercomputers, which are extremely powerful computers used for complex calculations and simulations. Petaflops technology has revolutionized the computer industry by allowing for faster and more efficient processing of large amounts of data. This has enabled advancements in fields such as weather forecasting, climate modeling, and drug discovery. Petaflops technology has also been utilized in artificial intelligence and machine learning, allowing for more accurate and sophisticated algorithms. In simpler terms, Petaflops is like a race car for computers, allowing them to process information at lightning-fast speeds and tackle complex problems that were previously impossible to solve.
Latest Articles about Petaflops
Xeon: The Intel Xeon processor is a powerful and reliable processor used in many computer systems. It is a multi-core processor that is designed to handle multiple tasks simultaneously. It is used in servers, workstations, and high-end desktop computers. It is also used in many embedded systems, such as routers and switches. The Xeon processor is known for its high performance and scalability, making it a popular choice for many computer applications. It is also used in many cloud computing applications, as it is capable of handling large amounts of data and providing high levels of performance. The Xeon processor is also used in many scientific and engineering applications, as it is capable of handling complex calculations and simulations.
Latest Articles about Xeon
Trending Posts
Advantech Introduces New Network Appliances Featuring AMD Processing Power
DDN Introduces Advanced Data Intelligence Platform Targeting HPC and AI Needs
ASUS Republic of Gamers introduces the New ROG Phone 9 Lineup
Turtle Beach Introduces Victrix Pro KO: A New Era for Fight Sticks
Google experiments with new way to report scams in Phone app
Evergreen Posts
NZXT about to launch the H6 Flow RGB, a HYTE Y60’ish Mid tower case
Intel’s CPU Roadmap: 15th Gen Arrow Lake Arriving Q4 2024, Panther Lake and Nova Lake Follow
HYTE teases the “HYTE Y70 Touch” case with large touch screen
NVIDIA’s Data-Center Roadmap Reveals GB200 and GX200 GPUs for 2024-2025
S.T.A.L.K.E.R. 2: Heart of Chornobyl Pushed to November 20, introduces Fresh Trailer