Support Questions

Find answers, ask questions, and share your expertise

Hadoop Performance

avatar
Rising Star

We know that hadoop main purpose is increasing Performance through adding more data nodes

but my question is

if we want to retrieve the data only with out the need to process it or

analyze it )

is adding more data nodes will be useful or it doesn't increase the performance at all because we have retrieve operations only with out any computations or map reduce jobs

1 ACCEPTED SOLUTION

avatar
Super Guru

@oula.alshiekh@gmail.com alshiekhadd datanode if you are running out of storage capacity of cluster, add computation node when you see bottleneck in processing, by adding more computation nodes you can launch more mapreduce/spark task. you can also use your node to store data as well as to add more processing capacity(in terms of more no mapreduce tasks)

View solution in original post

1 REPLY 1

avatar
Super Guru

@oula.alshiekh@gmail.com alshiekhadd datanode if you are running out of storage capacity of cluster, add computation node when you see bottleneck in processing, by adding more computation nodes you can launch more mapreduce/spark task. you can also use your node to store data as well as to add more processing capacity(in terms of more no mapreduce tasks)