Applications & OS News
The 10 Coolest Open-Source Software Tools Of 2022
High-performance data management and analytics tools, software for managing APIs and Kubernetes containers, and application development and machine learning platforms: Here’s a look at the open-source software—both new and tried-and-true—that caught our attention in 2022.
Apache Iceberg is a high-performance format for large-scale data analytics tables, according to a description on the apache.org website. Iceberg brings the reliability and simplicity of SQL tables to big data tasks while making it possible for data processing engines like Spark, Trino, Flink, Presto, Hive and Impala to safely work with the same tables at the same time.
Table formats are a key component of data architecture that businesses and organizations adopt as part of their strategies for implementing data warehouse, data lake and data mesh systems.
Iceberg was created in 2017 by developers at Netflix to address challenges the media giant was experiencing with its data warehouse operations, according to a Wikipedia history. It was donated to the Apache Software Foundation in November 2018 and is used today by a number of major companies including Airbnb, Apple, Expedia and Lyft.
Iceberg is released under the Apache License 2.0. A number of big data technology companies including Cloudera, Dremio and Snowflake have embraced Iceberg.