We are pleased to announce general availability of Amazon EC2 P5en.48xl instances on SageMaker notebook instances. Amazon EC2 P5en instances feature 8 H200 GPUs which have 1.7x GPU memory size and 1.4x GPU memory bandwidth than H100 GPUs featured in P5 instances. P5en instances pair the H200 GPUs with high performance custom 4 th Generation Intel Xeon Scalable processors, enabling Gen5 PCIe between CPU and GPU which provides up to 4x the bandwidth between CPU and GPU and boosts AI training and inference performance. P5en, with up to 3200 Gbps of third generation of EFA using Nitro v5, shows up
The continuous demand for higher computational power in AI training and inference, especially for large language models and complex AI workloads, drives rapid advancements in GPU and instance technology.
This announcement signifies a significant leap in available compute power for AI development, enabling researchers and businesses to train larger models faster and deploy more sophisticated AI applications at scale.
SageMaker users now have access to industry-leading H200 GPUs with enhanced memory and bandwidth, reducing training times and increasing the complexity of models that can be run efficiently.
- · AWS (Amazon Web Services)
- · AI developers and researchers
- · Companies adopting advanced AI
- · NVIDIA (H200 GPU manufacturer)
- · Smaller cloud providers with less advanced AI infrastructure
- · Companies relying on older GPU generations for AI workloads
Increased performance and efficiency for AI model training and inference on AWS SageMaker.
Accelerated development and deployment of more complex and higher-performing AI applications across various industries.
Further concentration of advanced AI development on leading cloud platforms, potentially widening the gap with those lacking similar infrastructure.
This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.
Read at AWS What's New