Advertisement

Applications & OS News

The 10 Coolest Big Data Tools Of 2021

Rick Whiting

Working with ever-growing volumes of data continues to be a challenge for business and organizations. Here are 10 cool big data tools that caught our attention in 2021.

Alluxio Data Orchestration Platform

In November Alluxio launched version 2.7 of the Alluxio Data Orchestration Platform, the San Mateo, Calif.-based company’s software for managing large-scale distributed data workloads and connecting data sources scattered across hybrid and multi-cloud IT systems.

Alluxio’s software is a virtual distributed file system that separates compute from storage and makes all data appear local no matter where it’s stored. That unifies access to distributed data, providing a way to link business analytics and data-driven applications to distributed data sources and making management of distributed data more efficient.

The new 2.7 release provides a five-fold improvement in I/O performance efficiency for machine learning training by parallelizing data loading, data pre-processing and training pipelines. The new edition also offers enhanced performance insight and support for open table formats like Apache Hudi and Iceberg, allowing the system to scale up access to data lakes for faster Presto- and Spark-based analytics.

 
Rick Whiting

Rick Whiting has been with CRN since 2006 and is currently a feature/special projects editor. Whiting manages a number of CRN’s signature annual editorial projects including Channel Chiefs, Partner Program Guide, Big Data 100, Emerging Vendors, Tech Innovators and Products of the Year. He also covers the Big Data beat for CRN. He can be reached at rwhiting@thechannelcompany.com.

Advertisement
Advertisement
Sponsored Post
Advertisement
Advertisement