Google Cloud Unveils 9 New Tools At First Data Cloud Summit
‘We fundamentally want to change…how companies are thinking of data from a technology-centric view to an ability-centric view,’ says Gerrit Kazmaier, Google Cloud’s new general manager and vice president for data analytics, databases and Looker.
Dataplex, now in preview, is an intelligent data fabric that allows organizations to centrally manage, monitor and govern their data across data lakes, data warehouses and databases from a single view to help automate data management.
Enterprise data is and will remain distributed and reside across on premises and one or more cloud environments, but enterprises lack the tools to build an integrated data platform across that footprint, according to Irina Farooq, director of product management for smart analytics at Google Cloud.
“Each of these systems comes with its own way to handle metadata, data quality, security and governance,” she said. “At the same time, the number of analytics and AI (artificial intelligence) users in the enterprise is growing exponentially. Each is looking for self-service access to high-quality data in an ever-growing set of tools of their choice. What this results in is the constant operational overhead of moving data around, duplicating data and creating homegrown processes to track data and users. The problem is that those processes become outdated the moment they’re put in, diminishing users trust in data and undermining key organizational priorities like financial governance and planning.”
Dataplex is built from the ground up for distributed data, requiring no data movement or duplication, according to Farooq. It focuses on three key areas of value: intelligent AI power in data management, centralized security and governance, and integrated analytics experiences that combine the best of Google Cloud Platform (GCP)-native tools such as BigQuery or Dataflow with the best of open-source tools such as Apache Spark.