Good day, We're planning to deploy a 5-nodes (1 for namenode, 4 for datanodes) CDH cluster (managed by Cloudera Manager) for internal development and experimentations. Can anyone suggests what is the recommended Hardware requirements (especially OS Disk capacity for both NameNode and DataNodes) that we can follow for our environment? We already found this blog ( https://blog.cloudera.com/blog/2013/08/how-to-select-the-right-hardware-for-your-new-hadoop-cluster/ ) but we find it's too much for our experimental cluster. Thanks, Reijay
... View more
Hi, we have a virtualize environment (for testing purposes only) and planning to install Cloudera on 4 vm's. So why is the firewall open? It's because we need to minimize the network traffic since we are using NAT in order to access the internet (we only have 1 public IP). We configure some networking stuff. We encounter the error of port limits (which is not related to cloudera) using this method, thus we have no choice but to limit the ports that are open and thats why we come up on using the firewall or turning it on while installation. Now, we have seen the list of the ports that cloudera uses and it is too many if we need to register it on firewall manually. Is there any scripts or tools to be used in order to perform this with an ease ??
... View more