AWS and NVIDIA join forces to push boundaries of Generative AI Innovation.


March 19, 2024 by our News Team

Amazon Web Services (AWS) and NVIDIA have announced a partnership to bring the new NVIDIA Blackwell GPU platform to AWS, aiming to provide advanced infrastructure, software, and services for AI capabilities.

  • The collaboration between AWS and NVIDIA brings together two industry leaders in cloud computing and AI, providing customers with the most advanced infrastructure and services for their AI workloads.
  • The NVIDIA Blackwell platform, featuring the GB200 Grace Blackwell Superchip and B100 Tensor Core GPUs, will be available on AWS, allowing customers to scale their AI workloads to thousands of GB200 Superchips.
  • The collaboration addresses security concerns by utilizing AWS's Nitro System, AWS KMS, EFA, and Nitro Enclaves, along with NVIDIA's GB200 encryption capabilities, to ensure secure handling of data throughout the training workflow.


Amazon Web Services (AWS) and nVidia have announced their collaboration to bring the new NVIDIA Blackwell GPU platform to AWS. This partnership aims to provide customers with the most advanced infrastructure, software, and services to unlock new generative artificial intelligence (AI) capabilities. By combining NVIDIA’s latest multi-node systems with AWS’s powerful networking and advanced security features, customers can build and run real-time inference on multi-trillion parameter large language models (LLMs) faster and at a lower cost.

The NVIDIA Blackwell platform, featuring the GB200 Grace Blackwell Superchip and B100 Tensor Core GPUs, will be available on AWS. With AWS’s Nitro System, AWS Key Management Service (AWS KMS), Elastic Fabric Adapter (EFA), and Amazon Elastic Compute Cloud (Amazon EC2) UltraCluster hyper-scale clustering, customers can scale their AI workloads to thousands of GB200 Superchips. This collaboration enables a significant improvement in speeding up inference workloads for resource-intensive language models.

In addition to offering the NVIDIA Blackwell platform on AWS, plans are underway to deploy EC2 instances featuring the new B100 GPUs in EC2 UltraClusters for accelerating generative AI training and inference at massive scale. Furthermore, NVIDIA DGX Cloud, an AI platform co-engineered on AWS, will provide enterprise developers with dedicated access to infrastructure and software for building and deploying advanced generative AI models.

Security is a top priority when implementing AI, and this collaboration addresses that concern. The combination of the AWS Nitro System and the NVIDIA GB200 ensures that model weights are securely handled throughout the training workflow. The GB200 encrypts data transfer and allows physical encryption of connections between GPUs, while EFA encrypts data across servers for distributed training and inference. Additionally, AWS Nitro Enclaves and AWS KMS enable customers to create a trusted execution environment for their EC2 instances, providing unparalleled control over their data.

Project Ceiba, a collaboration between NVIDIA and AWS, is set to build one of the world’s fastest AI supercomputers exclusively hosted on AWS. This supercomputer, powered by the new NVIDIA GB200 NVL72, will advance AI research and development in various fields such as LLMs, graphics, simulation, robotics, and climate prediction.

The collaboration between AWS and NVIDIA extends to healthcare and life sciences. Together, they offer high-performance, low-cost inference for generative AI with Amazon SageMaker integration with NVIDIA NIM inference microservices. This integration reduces the time-to-market for generative AI applications. Additionally, AWS HealthOmics and NVIDIA Healthcare teams are working together to launch generative AI microservices for drug discovery and digital health, providing healthcare enterprises with GPU-accelerated cloud endpoints for biology, chemistry, imaging, and healthcare data.

Overall, this collaboration between AWS and NVIDIA brings together technologies to accelerate the development of generative AI applications and advance use cases across various industries.

AWS and NVIDIA join forces to push boundaries of Generative AI Innovation.

About Our Team

Our team comprises industry insiders with extensive experience in computers, semiconductors, games, and consumer electronics. With decades of collective experience, we’re committed to delivering timely, accurate, and engaging news content to our readers.

Background Information


About nVidia:

NVIDIA has firmly established itself as a leader in the realm of client computing, continuously pushing the boundaries of innovation in graphics and AI technologies. With a deep commitment to enhancing user experiences, NVIDIA's client computing business focuses on delivering solutions that power everything from gaming and creative workloads to enterprise applications. for its GeForce graphics cards, the company has redefined high-performance gaming, setting industry standards for realistic visuals, fluid frame rates, and immersive experiences. Complementing its gaming expertise, NVIDIA's Quadro and NVIDIA RTX graphics cards cater to professionals in design, content creation, and scientific fields, enabling real-time ray tracing and AI-driven workflows that elevate productivity and creativity to unprecedented heights. By seamlessly integrating graphics, AI, and software, NVIDIA continues to shape the landscape of client computing, fostering innovation and immersive interactions in a rapidly evolving digital world.

nVidia website  nVidia LinkedIn
Latest Articles about nVidia

Technology Explained


AWS: Amazon Web Services (AWS) is a cloud platform powered by Amazon that enables users to access cloud computing services, such as storage, data analytics, and distributed computing. It offers users the ability to utilize both on-demand and pay-as-you-go computing services, making it a great option for the computer industry. It offers a wide range of services with great flexibility for a variety of uses. It can help companies build powerful web and mobile applications, run large-scale analytics, quickly provision servers and other services, design sophisticated architectures for data storage, and more. AWS provides access to a wide range of services such as virtualization, storage, database, monitoring, analytics, and other services that can help organizations increase agility, manage complexity, and remain on the cutting edge of technology. Many big and famous organizations use AWS services to give them a competitive edge, and more and more companies are turning to this service for their computer needs.

Latest Articles about AWS

EC2: Amazon EC2 (Elastic Compute Cloud) is a cloud service provided by Amazon Web Services (AWS). It is a virtual computing environment that allows users to rent or lease an online server, compute power, storage, and other computing resources. EC2 is a highly reliable, cost-effective, easily scalable, and quickly available cloud computing service that allows users to deploy and configure their own computing resources. It has helped businesses around the world to quickly and securely scale their operations, while minimizing IT costs, enabling them to spin up virtual servers in minutes without having to worry about provisioning, maintaining, or managing hardware. Its ease of use and reliable performance has made it an attractive choice for businesses that require a fast, seamless computing solution. EC2 can be used for a wide range of applications, from big data analysis, precise medical imaging, machine learning, or web and mobile app development to 3D rendering, simulation, and gaming.

Latest Articles about EC2

GPU: GPU stands for Graphics Processing Unit and is a specialized type of processor designed to handle graphics-intensive tasks. It is used in the computer industry to render images, videos, and 3D graphics. GPUs are used in gaming consoles, PCs, and mobile devices to provide a smooth and immersive gaming experience. They are also used in the medical field to create 3D models of organs and tissues, and in the automotive industry to create virtual prototypes of cars. GPUs are also used in the field of artificial intelligence to process large amounts of data and create complex models. GPUs are becoming increasingly important in the computer industry as they are able to process large amounts of data quickly and efficiently.

Latest Articles about GPU

Grace Blackwell: Grace Blackwell is a cutting-edge technology that has revolutionized the computer industry. It is a type of artificial intelligence that is designed to mimic human cognitive abilities, such as learning, problem-solving, and decision-making. This technology has been applied in various areas of the computer industry, including data analysis, natural language processing, and machine learning. For example, Grace Blackwell can analyze large amounts of data and identify patterns and trends, making it a valuable tool for businesses to make informed decisions. It can also understand and respond to human language, making it useful for virtual assistants and chatbots. Additionally, Grace Blackwell can continuously learn and improve its performance, making it an invaluable asset in the development of new technologies. Overall, Grace Blackwell has greatly enhanced the capabilities of computers and has opened up new possibilities for the future of technology.

Latest Articles about Grace Blackwell




Leave a Reply