Cloud News
AWS’ 6 Huge Generative AI Products, Partner Plans: Ruba Borno
Mark Haranas
AWS’ worldwide channel chief, Ruba Borno, tells CRN about the newest generative AI products partners should be taking to market today along with the resources AWS is providing.

New Amazon EC2 Inf2 Makes ‘Generative AI Cost-Efficient’
AWS recently made generally available Amazon EC2 Inf2 instances powered by AWS’ own Inferentia2 chips, which aims to lower the cost of running generative AI workloads.
“We’re making generative AI cost-efficient from a best-in-class infrastructure perspective,” said Borno.
“If you think about generative AI, there’s two types of work being done: one is the inferences, and the other is the training that has to happen. So training the models and then inferring answers from the model,” said Borno. “Amazon EC2 Inferentia2 instances deliver 4X higher throughput and 10X lower latency compared to the prior generation. And 40 percent better inference price performance than any other EC2 instance. So we’re really enabling inference.”
Borno said lowering costs and energy consumption make generative AI more accessible to a wider variety of customers.