Support Questions

oula_alshiekh · ‎12-08-2016

We know that hadoop main purpose is increasing Performance through adding more data nodes

but my question is

if we want to retrieve the data only with out the need to process it or

analyze it )

is adding more data nodes will be useful or it doesn't increase the performance at all because we have retrieve operations only with out any computations or map reduce jobs

rajkumar_singh · ‎12-08-2016

@oula.alshiekh@gmail.com alshiekhadd datanode if you are running out of storage capacity of cluster, add computation node when you see bottleneck in processing, by adding more computation nodes you can launch more mapreduce/spark task. you can also use your node to store data as well as to add more processing capacity(in terms of more no mapreduce tasks)

View solution in original post

rajkumar_singh · ‎12-08-2016

@oula.alshiekh@gmail.com alshiekhadd datanode if you are running out of storage capacity of cluster, add computation node when you see bottleneck in processing, by adding more computation nodes you can launch more mapreduce/spark task. you can also use your node to store data as well as to add more processing capacity(in terms of more no mapreduce tasks)

Cloudera Community

Support Questions

Hadoop Performance