We are pleased to announce general availability of Amazon EC2 G6e instances on SageMaker notebook instances. Amazon EC2 G6e instances are powered by up to 8 NVIDIA L40s Tensor Core GPUs with 48 GB of memory per GPU and third generation AMD EPYC processors. G6e instances deliver up to 2.5x better performance compared to EC2 G5 instances. Customers can use G6e instances to interactively test model deployment and for interactive model training use cases such as generative AI fine-tuning. You can use G6e instances to deploy large language models (LLMs) with up to 13B parameters and diffusion model
The continuous evolution of AI models, particularly generative AI and LLMs, demands increasingly powerful and efficient computing infrastructure, leading to rapid hardware updates.
This development allows for more performant and cost-effective interactive development and deployment of advanced AI, directly impacting the speed and scale of innovation in generative AI and LLMs.
Developers now have access to significantly more powerful GPU instances for AI model training and deployment within SageMaker, improving iteration speed and model capacity.
- · AWS
- · NVIDIA
- · AI/ML developers
- · Generative AI startups
- · Lesser performing cloud instance providers
- · Companies with older AI infrastructure
Increased adoption and accelerated development of large language models and diffusion models on AWS.
Reduced barriers to entry for deploying complex AI models, leading to a proliferation of more sophisticated AI applications.
Intensified AI compute arms race among cloud providers, further centralizing AI development on large platforms.
This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.
Read at AWS What's New