Supermicro Expands AI and GPU Rack Scale Solutions, Now Compatible with AMD Instinct MI300 Series


December 7, 2023 by our News Team

Supermicro introduces three new AMD-based GPU servers optimized for high-performance and efficiency, featuring the latest AMD Instinct MI300 Series accelerators and offering a range of configurations for AI, LLM, and HPC applications.

  • Optimized for high-performance and efficiency
  • Scalable for complex AI, LLM, and HPC applications
  • Streamlined delivery with worldwide manufacturing facilities


Supermicro, a leading IT solution manufacturer, has launched three new additions to its AMD-based H13 generation of GPU Servers. These servers are optimized to deliver high-performance and efficiency, thanks to the new AMD Instinct MI300 Series accelerators. The new servers, available in both liquid-cooled and air-cooled configurations, are designed to meet the complex demands of AI, LLM, and HPC applications.

The 2U liquid-cooled and 4U air-cooled servers feature the AMD Instinct MI300A Accelerated Processing Units (APUs) accelerators, which improve data center efficiencies. These systems also come with quad APUs for scalable applications. Supermicro is capable of delivering complete liquid-cooled racks for large-scale environments, offering up to 1,728 TFlops of FP64 performance per rack. With its worldwide manufacturing facilities, Supermicro ensures streamlined delivery of these new servers for AI and HPC convergence.

“We are very excited to expand our rack scale Total IT Solutions for AI training with the latest generation of AMD Instinct accelerators, with up to 3.4X the performance improvement compared to previous generations,” said Charles Liang, president and CEO of Supermicro. “Our proven architecture allows 1:1 400G networking dedicated for each GPU designed for large-scale AI and supercomputing clusters capable of fully integrated Liquid Cooling solutions, giving customers a competitive advantage for performance and superior efficiency with ease of deployment.”

One of the new systems, the LLM optimized AS-8125GS-TNMR2, is built on Supermicro’s building block architecture, which is known for high-performance AI systems. This system offers a balanced design that associates a GPU with a 1:1 networking setup, allowing for a large pool of high-bandwidth memory across nodes and racks. It is specifically designed to handle large language models with trillions of parameters, maximizing parallel computing and minimizing training time and inference Latency. The system features the MI300X OAM accelerator, which provides raw acceleration power with AMD Infinity Fabric Links. It also boasts industry-leading 1.5 TB HBM3 GPU memory and native sparse matrix support for AI workloads.

Supermicro is also introducing a density optimized 2U liquid-cooled server, the AS-2145GH-TNMR, and a 4U air-cooled server, the AS-4145GH-TNMR, both equipped with 4 AMD Instinct MI300A accelerators. These servers are designed for HPC and AI applications that require fast CPU to GPU communication. The APUs in these servers combine the highest-performing AMD CPU, GPU, and HBM3 memory on a single chip, eliminating redundant memory copies. Each server includes leadership x86 “Zen 4” CPU cores and 512 GB of HBM3 memory.

Both systems feature dual AIOMs with 400G Ethernet support and expanded networking options to improve space, scalability, and efficiency for high-performance computing. The 2U liquid-cooled system offers excellent total cost of ownership (TCO) with over a 35% energy consumption savings compared to air-cooled systems. It also reduces the number of fans by 70%.

“AMD Instinct MI300 Series accelerators deliver leadership performance, both for longstanding accelerated high-performance computing applications and for the rapidly growing demand for generative AI,” said Forrest Norrod, executive vice president and general manager, Data Center Solutions Business Group, AMD. “We continue to work closely with Supermicro to bring to market leading-edge AI and HPC total solutions based on MI300 Series accelerators and leveraging Supermicro’s expertise in system and data center design.”

Supermicro Expands AI and GPU Rack Scale Solutions, Now Compatible with AMD Instinct MI300 Series

Supermicro Expands AI and GPU Rack Scale Solutions, Now Compatible with AMD Instinct MI300 Series

Supermicro Expands AI and GPU Rack Scale Solutions, Now Compatible with AMD Instinct MI300 Series

Supermicro Expands AI and GPU Rack Scale Solutions, Now Compatible with AMD Instinct MI300 Series

Supermicro Expands AI and GPU Rack Scale Solutions, Now Compatible with AMD Instinct MI300 Series

About Our Team

Our team comprises industry insiders with extensive experience in computers, semiconductors, games, and consumer electronics. With decades of collective experience, we’re committed to delivering timely, accurate, and engaging news content to our readers.

Background Information


About AMD: AMD, a large player in the semiconductor industry is known for its powerful processors and graphic solutions, AMD has consistently pushed the boundaries of performance, efficiency, and user experience. With a customer-centric approach, the company has cultivated a reputation for delivering high-performance solutions that cater to the needs of gamers, professionals, and general users. AMD's Ryzen series of processors have redefined the landscape of desktop and laptop computing, offering impressive multi-core performance and competitive pricing that has challenged the dominance of its competitors. Complementing its processor expertise, AMD's Radeon graphics cards have also earned accolades for their efficiency and exceptional graphical capabilities, making them a favored choice among gamers and content creators. The company's commitment to innovation and technology continues to shape the client computing landscape, providing users with powerful tools to fuel their digital endeavors.

AMD website  AMD LinkedIn

About Supermicro: Supermicro is a reputable American technology company founded in 1993 and headquartered in San Jose, California. Specializing in high-performance server and storage solutions, Supermicro has become a trusted name in the data center industry. The company offers a wide range of innovative and customizable server hardware, including motherboards, servers, storage systems, and networking equipment, catering to the needs of enterprise clients, cloud service providers, and businesses seeking reliable infrastructure solutions.

Supermicro website  Supermicro LinkedIn

Technology Explained


CPU: The Central Processing Unit (CPU) is the brain of a computer, responsible for executing instructions and performing calculations. It is the most important component of a computer system, as it is responsible for controlling all other components. CPUs are used in a wide range of applications, from desktop computers to mobile devices, gaming consoles, and even supercomputers. CPUs are used to process data, execute instructions, and control the flow of information within a computer system. They are also used to control the input and output of data, as well as to store and retrieve data from memory. CPUs are essential for the functioning of any computer system, and their applications in the computer industry are vast.


GPU: GPU stands for Graphics Processing Unit and is a specialized type of processor designed to handle graphics-intensive tasks. It is used in the computer industry to render images, videos, and 3D graphics. GPUs are used in gaming consoles, PCs, and mobile devices to provide a smooth and immersive gaming experience. They are also used in the medical field to create 3D models of organs and tissues, and in the automotive industry to create virtual prototypes of cars. GPUs are also used in the field of artificial intelligence to process large amounts of data and create complex models. GPUs are becoming increasingly important in the computer industry as they are able to process large amounts of data quickly and efficiently.


Latency: Technology latency is the time it takes for a computer system to respond to a request. It is an important factor in the performance of computer systems, as it affects the speed and efficiency of data processing. In the computer industry, latency is a major factor in the performance of computer networks, storage systems, and other computer systems. Low latency is essential for applications that require fast response times, such as online gaming, streaming media, and real-time data processing. High latency can cause delays in data processing, resulting in slow response times and poor performance. To reduce latency, computer systems use various techniques such as caching, load balancing, and parallel processing. By reducing latency, computer systems can provide faster response times and improved performance.


Liquid Cooling: Liquid cooling is a technology used to cool down computer components, such as processors, graphics cards, and other components that generate a lot of heat. It works by circulating a liquid coolant, such as water or a special coolant, through a series of pipes and radiators. The liquid absorbs the heat from the components and then dissipates it into the air. This technology is becoming increasingly popular in the computer industry due to its ability to provide more efficient cooling than traditional air cooling methods. Liquid cooling can also be used to overclock components, allowing them to run at higher speeds than their rated speeds. This technology is becoming increasingly popular in the gaming industry, as it allows gamers to get the most out of their hardware.


LLM: A Large Language Model (LLM) is a highly advanced artificial intelligence system, often based on complex architectures like GPT-3.5, designed to comprehend and produce human-like text on a massive scale. LLMs possess exceptional capabilities in various natural language understanding and generation tasks, including answering questions, generating creative content, and delivering context-aware responses to textual inputs. These models undergo extensive training on vast datasets to grasp the nuances of language, making them invaluable tools for applications like chatbots, content generation, and language translation.





Leave a Reply