Created 08-22-2018 11:00 PM
As we all know, files on Hadoop will be copied 3 times to prevent loss. Usually there will be two copies on this rack and one on the other racks. However, when I want to delete a large number of datenode nodes, it is possible to delete the nodes containing 3 copies at the same time.
I have read the help documentation and there are two ways to do it. The link is as follows: https://www.cloudera.com/documentation/enterprise/5-10-x/topics/cm_mc_delete_hosts.html . However, there is no mention of bulk deletion of datenode nodes.
At the same time, I also modified hdfs-site.xml and then refreshed hdfs dfsadmin -refreshNodes, but it has no effect.
<property>
<name>dfs.hosts.exclude</name>
<value>dfshosts.exclude</value>
</property>
Therefore, I would like to ask technical experts, how to operate on the cloudera, can be deleted in batches, while ensuring that data is not lost? Or go directly to the cluster, use a similar configuration file such as hdfs-site.xml or core-site.xml to achieve the purpose?
Created 08-30-2018 02:00 AM
Created 08-23-2018 03:35 AM
Created 08-30-2018 02:00 AM