We are pleased to announce general availability of Amazon EC2 P5.4xl instances on SageMaker notebook instances. Amazon EC2 P5.4xl instances are powered by NVIDIA H100 Tensor Core GPUs and deliver high performance in Amazon EC2 for deep learning (DL) and high performance computing (HPC) applications. They help you accelerate your time to solution by up to 4x compared to previous-generation GPU-based EC2 instances, and reduce cost to train ML models by up to 40%. Customers can use P5 instances for training and deploying complex large language models (LLMs) and diffusion models powering generativ
The continuous evolution of AI models requires increasingly powerful and specialized compute infrastructure, driving AWS to integrate the latest NVIDIA H100 GPUs into its SageMaker offering.
This development makes high-performance compute accessible to a broader range of AI developers and researchers, accelerating the training and deployment of advanced models like LLMs and diffusion models.
Customers can now access NVIDIA H100 Tensor Core GPUs on SageMaker notebook instances, significantly reducing the cost and time required to develop and train complex AI models.
- · AWS
- · NVIDIA
- · AI developers and researchers
- · Companies developing large language and diffusion models
- · Companies relying on older GPU instances
- · Less agile cloud AI platform providers
Increased rate of advancement and deployment of complex AI models.
Reinforcement of cloud-based AI development as the standard, reducing barriers to entry for advanced model training.
Further market consolidation around providers offering state-of-the-art AI infrastructure and specialized hardware.
This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.
Read at AWS What's New