Advertisement

Cloud News

AWS’ 6 Huge Generative AI Products, Partner Plans: Ruba Borno

Mark Haranas

AWS’ worldwide channel chief, Ruba Borno, tells CRN about the newest generative AI products partners should be taking to market today along with the resources AWS is providing.

New Amazon EC2 Inf2 Makes ‘Generative AI Cost-Efficient’

AWS recently made generally available Amazon EC2 Inf2 instances powered by AWS’ own Inferentia2 chips, which aims to lower the cost of running generative AI workloads.

“We’re making generative AI cost-efficient from a best-in-class infrastructure perspective,” said Borno.

“If you think about generative AI, there’s two types of work being done: one is the inferences, and the other is the training that has to happen. So training the models and then inferring answers from the model,” said Borno. “Amazon EC2 Inferentia2 instances deliver 4X higher throughput and 10X lower latency compared to the prior generation. And 40 percent better inference price performance than any other EC2 instance. So we’re really enabling inference.”

Borno said lowering costs and energy consumption make generative AI more accessible to a wider variety of customers.

 

 
Mark Haranas

Mark Haranas is an assistant news editor and longtime journalist now covering cloud, multicloud, software, SaaS and channel partners at CRN. He speaks with world-renown CEOs and IT experts as well as covering breaking news and live events while also managing several CRN reporters. He can be reached at mharanas@thechannelcompany.com.

Advertisement
Advertisement
Sponsored Post
Advertisement

NEWSLETTER

Advertisement