Advertisement

Cloud News

Why Google Cloud’s Data Analytics Tops The Competition: Debanjan Saha

Donna Goodison

‘We have been in the business of organizing the world‘s information for a very long time,’ said Debanjan Saha, vice president and general manager of data analytics at Google Cloud. ‘Data is in our DNA, and that’s what drives Google.’

The Data Cloud

The No. 1 differentiator for Google Cloud, Saha said, is that it has the “most complete portfolio of data analytics products in the market.”

That’s a combination of its cloud-native products that have been built over time -- initially for Google’s internal use – its managed open-source products such as Dataproc and Cloud Data Fusion, and partner products including Databricks on Google Cloud, Apache Kafka as a service with Confluent Cloud on Google Cloud Platform (GCP) and Elastic on Google Cloud.

Dataproc is a managed Apache Spark and Apache Hadoop service that allows users to take advantage of open-source data tools for batch processing, querying, streaming and machine learning, while Cloud Data Fusion is a fully managed, cloud-native, enterprise data integration service for quickly building and managing data pipelines. 

“The reason for having all of these services is to give optionality and choice to our customers, but what makes it special is that we make it a platform,” Saha said. “Think of it like a data cloud -- that these are not individual products, but these are all tied together, so that people can do everything they want: collecting data, preparing them for analysis, doing the analytics on them, presenting them, presenting the insight in dashboards and many other formats, doing predictions with AI/ML.”

 
Advertisement
Advertisement
Sponsored Post
Advertisement
Advertisement