Intel and Ohio Supercomputer Center Boost AI Processing Power with HPC Cluster


February 21, 2024

Intel, Dell Technologies, Nvidia, and the Ohio Supercomputer Center have joined forces to unveil Cardinal, a state-of-the-art high-performance computing cluster designed to revolutionize research, education, and industry innovation in Ohio, particularly in the field of artificial intelligence.

1. hardware: The Cardinal Cluster is equipped with state-of-the-art hardware, including Dell PowerEdge servers and Intel Xeon CPU Max Series with high bandwidth memory (HBM), making it a powerful and efficient system for handling AI workloads. 2. Impressive specifications: With 756 Max Series CPU 9470 processors, 39,312 CPU cores, and 94 GB of HBM2E memory, the Cardinal Cluster boasts impressive specifications that double OSC's capabilities and cater to a wide range of use cases. 3. Collaboration between industry leaders: The collaboration between Intel, Dell Technologies, Nvidia, and the Ohio Supercomputer Center showcases the importance of HPC resources in driving innovation and advancing scientific breakthroughs. This partnership brings together the expertise and resources of these industry leaders to create a HPC cluster.


Intel, Dell Technologies, nVidia, and the Ohio Supercomputer Center (OSC) have joined forces to unveil Cardinal, a state-of-the-art high-performance computing (HPC) cluster. Designed to cater to the growing demand for HPC resources in Ohio, particularly in the field of artificial intelligence (AI), Cardinal is set to revolutionize research, education, and industry innovation.

AI and machine learning have become indispensable tools in scientific, engineering, and biomedical domains, enabling researchers to tackle complex questions. As the efficacy of these technologies becomes increasingly evident, academic disciplines like agricultural sciences, architecture, and social studies are embracing their potential. Cardinal has been equipped with hardware that can handle the expanding workloads of AI. This new cluster represents a significant upgrade from its predecessor, the Owens Cluster, which was launched back in 2016.

The Cardinal Cluster is a heterogeneous system that boasts Dell PowerEdge servers and the Intel Xeon CPU Max Series with high bandwidth memory (HBM). This powerful combination efficiently manages memory-bound HPC and AI workloads while promoting programmability, portability, and ecosystem adoption. The system boasts impressive specifications, including 756 Max Series CPU 9470 processors, providing a total of 39,312 CPU cores. Each node is equipped with 128 gigabytes (GB) of HBM2E memory and 512 GB of DDR5 memory. With a unified software stack and traditional programming models on the x86 base, the cluster not only doubles OSC’s capabilities but also caters to a wide range of use cases while ensuring easy adoption and deployment.

In addition to its impressive specifications, the Cardinal Cluster features thirty-two nodes with 104 cores, 1 terabyte (TB) of memory, and four Nvidia Hopper architecture-based H100 Tensor Core GPUs. These nodes are interconnected by four NVLink connections and boast 94 GB of HBM2E memory. The system also utilizes the NVIDIA Quantum-2 InfiniBand, which offers 400 gigabits per second (Gbps) of networking performance with low Latency. This enables the cluster to deliver a staggering 500 Petaflops of peak AI performance (FP8 Tensor Core, with sparsity) for large AI-driven scientific applications. Additionally, the cluster includes sixteen nodes with 104 cores, 128 GB HBM2E memory, and 2 TB DDR5 memory, specifically designed for large symmetric multiprocessing (SMP) style jobs.

“The Intel Xeon CPU Max Series is the perfect choice for developing and implementing HPC and AI workloads, leveraging the most widely adopted AI frameworks and libraries,” stated Ogi Brkic, vice president and general manager of Data Center AI Solutions product line at Intel. “The inherent heterogeneity of this system will empower OSC’s engineers, researchers, and scientists, enabling them to fully exploit its doubled memory bandwidth performance. We are proud to support OSC and our ecosystem with solutions that significantly accelerate the analysis of existing and future data in their targeted focus areas.”

With its exceptional performance capabilities and specifications, the Cardinal Cluster is set to drive innovation in AI research and development across Ohio. This collaborative effort between Intel, Dell Technologies, Nvidia, and the Ohio Supercomputer Center is a testament to the growing significance of HPC resources in advancing scientific breakthroughs and fostering technological advancements.

Intel and Ohio Supercomputer Center Boost AI Processing Power with HPC Cluster

Intel and Ohio Supercomputer Center Boost AI Processing Power with HPC Cluster

(Source)

Background Information


About Dell: Dell is a globally technology leader providing comprehensive solutions in the field of hardware, software, and services. for its customizable computers and enterprise solutions, Dell offers a diverse range of laptops, desktops, servers, and networking equipment. With a commitment to innovation and customer satisfaction, Dell caters to a wide range of consumer and business needs, making it a important player in the tech industry.

Dell website  Dell LinkedIn

About Intel: Intel Corporation, a global technology leader, is for its semiconductor innovations that power computing and communication devices worldwide. As a pioneer in microprocessor technology, Intel has left an indelible mark on the evolution of computing with its processors that drive everything from PCs to data centers and beyond. With a history of advancements, Intel's relentless pursuit of innovation continues to shape the digital landscape, offering solutions that empower businesses and individuals to achieve new levels of productivity and connectivity.

Intel website  Intel LinkedIn

About nVidia: NVIDIA has firmly established itself as a leader in the realm of client computing, continuously pushing the boundaries of innovation in graphics and AI technologies. With a deep commitment to enhancing user experiences, NVIDIA's client computing business focuses on delivering solutions that power everything from gaming and creative workloads to enterprise applications. for its GeForce graphics cards, the company has redefined high-performance gaming, setting industry standards for realistic visuals, fluid frame rates, and immersive experiences. Complementing its gaming expertise, NVIDIA's Quadro and NVIDIA RTX graphics cards cater to professionals in design, content creation, and scientific fields, enabling real-time ray tracing and AI-driven workflows that elevate productivity and creativity to unprecedented heights. By seamlessly integrating graphics, AI, and software, NVIDIA continues to shape the landscape of client computing, fostering innovation and immersive interactions in a rapidly evolving digital world.

nVidia website  nVidia LinkedIn

Technology Explained


CPU: The Central Processing Unit (CPU) is the brain of a computer, responsible for executing instructions and performing calculations. It is the most important component of a computer system, as it is responsible for controlling all other components. CPUs are used in a wide range of applications, from desktop computers to mobile devices, gaming consoles, and even supercomputers. CPUs are used to process data, execute instructions, and control the flow of information within a computer system. They are also used to control the input and output of data, as well as to store and retrieve data from memory. CPUs are essential for the functioning of any computer system, and their applications in the computer industry are vast.


DDR5: DDR5 (Double Data Rate 5) is the next generation of memory technology for the computer industry. It is a modern day improvement on earlier DDR technologies, with faster speeds, greater bandwidth and higher capacities. DDR5 enables higher resolution, seamless gaming experiences and faster data transfer rates, making it an ideal choice for high-performance computing and 4K gaming. With its greater RAM compatibility, DDR5 provides faster buffering times and raised clock speeds, giving users an improved overall work system. DDR5 is also optimized for multi-tasking, allowing users to multitask without experiencing a significant drop in performance, increasing the productivity of digital tasks. As an ever-evolving technology, DDR5 is paving the way for the computer industry into a new and powerful era.


HPC: HPC, or High Performance Computing, is a type of technology that allows computers to perform complex calculations and process large amounts of data at incredibly high speeds. This is achieved through the use of specialized hardware and software, such as supercomputers and parallel processing techniques. In the computer industry, HPC has a wide range of applications, from weather forecasting and scientific research to financial modeling and artificial intelligence. It enables researchers and businesses to tackle complex problems and analyze vast amounts of data in a fraction of the time it would take with traditional computing methods. HPC has revolutionized the way we approach data analysis and has opened up new possibilities for innovation and discovery in various fields.


Latency: Technology latency is the time it takes for a computer system to respond to a request. It is an important factor in the performance of computer systems, as it affects the speed and efficiency of data processing. In the computer industry, latency is a major factor in the performance of computer networks, storage systems, and other computer systems. Low latency is essential for applications that require fast response times, such as online gaming, streaming media, and real-time data processing. High latency can cause delays in data processing, resulting in slow response times and poor performance. To reduce latency, computer systems use various techniques such as caching, load balancing, and parallel processing. By reducing latency, computer systems can provide faster response times and improved performance.


Petaflops: Petaflops is a measure of computing speed, specifically one quadrillion floating-point operations per second. This technology is used to measure the performance of supercomputers, which are extremely powerful computers used for complex calculations and simulations. Petaflops technology has revolutionized the computer industry by allowing for faster and more efficient processing of large amounts of data. This has enabled advancements in fields such as weather forecasting, climate modeling, and drug discovery. Petaflops technology has also been utilized in artificial intelligence and machine learning, allowing for more accurate and sophisticated algorithms. In simpler terms, Petaflops is like a race car for computers, allowing them to process information at lightning-fast speeds and tackle complex problems that were previously impossible to solve.


Xeon: The Intel Xeon processor is a powerful and reliable processor used in many computer systems. It is a multi-core processor that is designed to handle multiple tasks simultaneously. It is used in servers, workstations, and high-end desktop computers. It is also used in many embedded systems, such as routers and switches. The Xeon processor is known for its high performance and scalability, making it a popular choice for many computer applications. It is also used in many cloud computing applications, as it is capable of handling large amounts of data and providing high levels of performance. The Xeon processor is also used in many scientific and engineering applications, as it is capable of handling complex calculations and simulations.





Leave a Reply