Support Questions
Find answers, ask questions, and share your expertise

Hadoop Performance

We know that hadoop main purpose is increasing Performance through adding more data nodes

but my question is

if we want to retrieve the data only with out the need to process it or

analyze it )

is adding more data nodes will be useful or it doesn't increase the performance at all because we have retrieve operations only with out any computations or map reduce jobs

1 ACCEPTED SOLUTION

@oula.alshiekh@gmail.com alshiekhadd datanode if you are running out of storage capacity of cluster, add computation node when you see bottleneck in processing, by adding more computation nodes you can launch more mapreduce/spark task. you can also use your node to store data as well as to add more processing capacity(in terms of more no mapreduce tasks)

View solution in original post

1 REPLY 1

@oula.alshiekh@gmail.com alshiekhadd datanode if you are running out of storage capacity of cluster, add computation node when you see bottleneck in processing, by adding more computation nodes you can launch more mapreduce/spark task. you can also use your node to store data as well as to add more processing capacity(in terms of more no mapreduce tasks)

Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.