At this time, we’re saying the final availability of Amazon Elastic Compute Cloud (Amazon EC2) G7 situations, delivering excessive efficiency GPU acceleration for AI inference, graphics, and information analytics workloads.
AWS is the primary main cloud supplier to assist NVIDIA RTX PRO 4500 Blackwell Server Version GPUs. G7 situations are accelerated by these GPUs with customized sixth-generation Intel Xeon Scalable processors, delivering as much as 4.6x AI inference efficiency and as much as 2.1x graphics efficiency in comparison with G6 situations. G7 situations additionally ship sooner efficiency for GPU-accelerated analytics on Amazon EMR on Amazon Elastic Kubernetes Service (Amazon EKS). G7 situations are effectively suited to a broad vary of GPU-enabled workloads together with AI inference, graphics rendering, video transcoding and analytics, spatial computing, digital desktop infrastructure (VDI), and information analytics.
Listed below are enhancements of G7 situations in comparison with earlier technology:
- Sooner GPU reminiscence – NVIDIA RTX PRO 4500 Blackwell Server Version GPUs provide 1.33 occasions the GPU reminiscence capability and a couple of.45 occasions the GPU reminiscence bandwidth in comparison with G6 situations. With 32 GB of GPU reminiscence per GPU, fifth Gen Tensor Cores, and 4th Gen RT Cores, G7 situations ship enhanced AI inference and graphics efficiency.
- Excessive efficiency networking and storage – G7 situations include 700 Gbps of EFA-enabled networking throughput (7x in comparison with G6) enabling the low-latency, high-bandwidth connectivity that AI inference, graphics-intensive functions, and GPU-accelerated information analytics workloads must carry out at their greatest. G7 situations assist as much as 7.6 TB native NVMe SSD storage, enabling you to maintain giant fashions and datasets near compute, scale back information switch overhead, and enhance throughput.
- Superior video encoding and decoding engines – Ninth-generation NVENC and sixth-generation NVDEC engines assist 4:2:2 encoding and decoding for high-resolution video workflows, delivering 1.5x concurrent video streams in comparison with previous-generation G6 situations.
EC2 G7 occasion specs
G7 situations function as much as 8 NVIDIA RTX PRO 4500 Blackwell Server Version GPUs with as much as 256 GB of complete GPU reminiscence (32 GB of reminiscence per GPU) and customized Intel Xeon Scalable processors. In addition they can be found in 7 sizes and assist as much as 192 vCPUs, as much as 700 Gbps of community bandwidth, as much as 768 GiB of system reminiscence, and as much as 7.6 TB of native NVMe SSD storage.
Listed below are the specs:
| Occasion title | GPUs | GPU reminiscence (GB) | vCPUs | Reminiscence (GiB) | Storage | EBS bandwidth (Gbps) | Community bandwidth (Gbps) |
| g7.2xlarge | 1 | 32 | 8 | 32 | 1 x 600 | As much as 8 | As much as 60 |
| g7.4xlarge | 1 | 32 | 16 | 64 | 1 x 600 | 8 | As much as 100 |
| g7.8xlarge | 1 | 32 | 32 | 128 | 1 x 950 | 16 | As much as 100 |
| g7.12xlarge | 2 | 64 | 48 | 192 | 1 x 1900 | 20 | 175 |
| g7.24xlarge | 4 | 128 | 96 | 384 | 1 x 3800 | 40 | 350 |
| g7.48xlarge | 8 | 256 | 192 | 768 | 2 x 3800 | 80 | 700 |
| g7.steel* | 8 | 256 | 192 | 768 | 2 x 3800 | 80 | 700 |
* Coming quickly
G7 situations assist NVIDIA GPUDirect P2P for multi-GPU sizes, NVIDIA GPUDirect RDMA with EFA, and GPUDirect RDMA with EFA for Amazon FSx for Lustre, enabling low-latency GPU-to-GPU communication for multi-GPU and multi-node workloads.
To get began with G7 situations, you should utilize the AWS Deep Studying AMIs (DLAMI) or NVIDIA Workstation AMIs with prepackaged GPU drivers on your AI inference and graphics workloads. To make use of G7 situations with Amazon EKS, construct EKS AMIs with NVIDIA driver model R595 with EKS-provided automation. G7 situations assist a number of working techniques together with Amazon Linux, Ubuntu, RHEL, and Home windows Server, with complete NVIDIA driver integration offering compatibility with industry-standard graphics libraries together with DirectX, Vulkan, and OpenGL.
Get began at present
You can begin utilizing Amazon EC2 G7 situations at present in two AWS areas: US East (Ohio) and US West (Oregon). To examine future Regional enlargement plans, search for the occasion kind within the CloudFormation assets tab on the AWS Capabilities by Area web page.
G7 situations are supplied by way of a number of buying choices, together with On-Demand, Financial savings Plans, and Spot Situations. Devoted Situations are additionally supported for the 12xlarge, 24xlarge, and 48xlarge sizes. For detailed pricing, go to the Amazon EC2 Pricing web page.
Able to get began? Launch G7 situations from the Amazon EC2 console. For extra particulars, head over to the Amazon EC2 G7 situations web page. We’d love to listen to your suggestions. Share it on AWS re:Submit for EC2 or attain out by way of your normal AWS Help contacts.
– Daniel Abib

