Nutanix’s New Nvidia Agentic AI Platform For GPUs And AI Factories Unveiled

Nutanix builds a new Agentic AI software stack that integrates with Nvidia to drive GPU efficiency and agentic innovation inside Nvidia-certified AI factories.

Nutanix launched its new Agentic AI software stack with deep integration with Nvidia that offers a cloud operating model to drive Nvidia GPU efficiency and agentic AI transformation for customers.

The new Nutanix Agentic AI offering integrates Nvidia AI Enterprise at the Agent Builder layer and can orchestrate Nvidia’s ecosystem of AI factories, while also letting teams instantly deploy Nvidia NIMs—including Nemotron—to accelerate the development of AI applications.

“By treating AI infrastructure as a centralized, shared, highly flexible resource—rather than each project as a silo—we are bringing the cloud operating model to the AI factory,” said Thomas Cornely, executive vice president of product management at Nutanix, in a blog post.

[Related: Nutanix, VMware Heads On Memory And CPU Shortage Impacts]

Nutanix Agentic AI extends the San Jose, Calif.-based company’s AHV hypervisor, Flow Virtual Networking, Nutanix Kubernetes Platform and Nutanix Enterprise AI to enable customers to build, operate and govern AI factories while providing agentic AI developers with models and platform services.

“This architecture doesn’t just simplify operations,” Cornely said. “It fundamentally optimizes AI economics by minimizing the cost per token across dynamic, multi-user environments.”

The new offering automatically optimizes workload placement across GPU-dense servers for maximum performance without the need for manual tuning.

Before jumping into more details, it’s key to know that Nutanix recently reported second fiscal quarter sales of $723 million, up 10 percent year over year.

The company now has a $2.9 billion annual run rate.

Here’s what you need to know about the new Nutanix Agentic AI offering with Nvidia.

Designed For Lower Token Costs

The Nutanix Agentic AI offering delivers optimized performance and security and is designed to enable lower, predictable token costs by providing things like an advanced AI Gateway and Model-as-a-Service.

It includes the newly released Nutanix Enterprise AI version 2.6 that includes the AI Gateway service for unified policy control over cloud-hosted and private LLMs, as well as new support for the Model Context Protocol (MCP) server and fine-tuning.

In addition, it includes the Nutanix Kubernetes Platform with a catalog of prebuilt open- source AI developer tools including Notebooks, Vector Databases, MLOps workflow engines and Agentic frameworks.

Nvidia GPU Efficiency And NIM Microservices

The new Agentic AI offering also includes Nutanix Unified Storage that delivers linearly scalable read/write performance for thousands of GPU clients.

Nutanix said by providing a high-capacity tier for KV Cache offloading and support for S3 over RDMA and NFS over RDMA, it provides a scalable, low-latency data fabric that maximizes GPU efficiency across all enterprise AI workloads.

The Nutanix Flow Virtual Networking offering has been enhanced to offload the network dataplane to Nvidia BlueField, delivering high-performance networking while reducing host CPU and memory consumption.

Developers can deploy Nvidia NIM microservices on the new Nutanix platform to accelerate the development of high-performance AI applications in production.

“These enhanced capabilities bring all the benefits of virtual machines for workload and tenant isolation, day 2 operations, and infrastructure resilience to Agentic AI workloads with maximum performance, security and resource utilization to help achieve lower cost per token,” said Cornely.

Accenture: New Nutanix Offering Enables ‘Agentic AI At Scale’

In an email to CRN, Accenture said the new Nutanix Agentic AI solution will help “reinvent” operations and enable “agentic AI at scale.”

“The capabilities that Nutanix will offer in their Agentic AI solution can help organizations build an agentic architecture to create an integrated AI interface across data and platforms and reimagine how processes are structured for more unified and efficient operations,” said Accenture’s Dave Malik, global infrastructure engineering AI and high-performance computing lead.

Customers can deploy AI factories on hardware from Cisco Systems, Dell Technologies, Lenovo and Supermicro, supported with validation by Nutanix and Nvidia.

“The future belongs to the Agentic Enterprise. By orchestrating the interaction between GPU compute, software-defined networking and enterprise data services, Nutanix is doing more than just racking and stacking hardware,” said Nutanix’s Cornely.

“We are providing a running and regulated environment that turns AI complexity into a scalable competitive edge,” he said.