Components & Peripherals News
8 Big Announcements At Nvidia’s GTC 2023: From Generative AI Services to New GPUs
At Nvidia’s GTC 2023 event, the chip designer revealed new cloud services meant to help enterprises build generative AI and metaverse applications as well as new GPUs, systems and other components. CRN outlines the biggest announcements that will open new opportunities for partners.
New Specialized Data Center GPUs For AI, Graphics
Nvidia expanded its portfolio of data center GPUs with products that are specialized for AI-powered video performance and large language models.
Designed for AI-powered video applications, Nvidia is pitching the L4 GPU as much more faster and energy efficient than CPUs for those workloads. Compared to a dual-socket server with two Intel Xeon Platinum 8380s, from 2021’s third generation of Xeon Scalable processors, a server with eight L4 GPUs is 120 faster and 99 percent more energy efficient, according to Nvidia. The GPU is built for video decoding and transcoding, video streaming, augmented reality, AI-generated video and other video workloads.
Google Cloud has launched a private preview of L4-powered instances with the G2 virtual machines. The GPU is also available in systems from more than 30 vendors, including ASUS, Atos, Cisco Systems, Dell Technologies, Gigabyte, Hewlett Packard Enterprise, Lenovo and Supermicro.
The company also revealed the H100 NVL, which combines two H100 PCIe cards and connects them with an NVlink bridge. Designed to running inference on massive large language models like the popular ChatGPT at scale, the GPU comes with 94GB of memory, thanks to Hopper’s Transformer Engine, provides up to 12 times faster inference performance for the GPT-3 model compared to the A100.
The H100 NVL is expected to launch in the second half of the year.