Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

What is the difference between Google Dataflow and Hadoop Data Flow

Solved Go to solution
Highlighted

What is the difference between Google Dataflow and Hadoop Data Flow

Can someone please help me understand the difference between Google Cloud DataFlow and Hortonworks DataFlow. Are there technical differences I should be aware of?

1 ACCEPTED SOLUTION

Accepted Solutions

Re: What is the difference between Google Dataflow and Hadoop Data Flow

Explorer

Google Cloud Dataflow is a service which replaces MapReduce processing, and is designed strictly for the Google Compute Cloud. Whereas Hortonworks Dataflow is a product aiming to solve data flow problems, even outside of data center.

So the answer is no, they are essentially using similar names to describe very different things. One is sitting in the cloud waiting for data to be delivered to it; and the other one delivers data to all kinds of processing systems: Google Dataflow, Storm, Spark, etc.

View solution in original post

3 REPLIES 3

Re: What is the difference between Google Dataflow and Hadoop Data Flow

Explorer

Google Cloud Dataflow is a service which replaces MapReduce processing, and is designed strictly for the Google Compute Cloud. Whereas Hortonworks Dataflow is a product aiming to solve data flow problems, even outside of data center.

So the answer is no, they are essentially using similar names to describe very different things. One is sitting in the cloud waiting for data to be delivered to it; and the other one delivers data to all kinds of processing systems: Google Dataflow, Storm, Spark, etc.

View solution in original post

Highlighted

Re: What is the difference between Google Dataflow and Hadoop Data Flow

Mentor

Google Dataflow is a language framework for multiple engines like Spark, Flink and mapreduce. Hadoop Data Flow is a data in motion processing tool with a visual editor.

Highlighted

Re: What is the difference between Google Dataflow and Hadoop Data Flow

Expert Contributor
Don't have an account?
Coming from Hortonworks? Activate your account here