About Akarsh

Akarsh · ‎04-05-2022

There are 2 ways. One is directly adding it in hdfs-site.xml OR just triggering the balancer with these parameters like nohup hdfs balancer -Ddfs.balancer.moverThreads=300 -Ddfs.datanode.balance.max.concurrent.moves=20 -Ddfs.datanode.balance.bandwidthPerSec=20480000 -Ddfs.balancer.dispatcherThreads=400 -Ddfs.balancer.max-size-to-move=100737418240 -threshold 10 >/tmp/new_balancer1.out This will run the balancer in non default values and it will finish the balancer operation much more quicker. ** Be aware that the run using above command and parameter will cause high Bandwidth usage and will create lot of i/o storms. For more details on the parameters mentioned above please refer below doc https://hadoop.apache.org/docs/r2.9.0/hadoop-project-dist/hadoop-hdfs/hdfs-default.xml

Akarsh · ‎04-05-2022

Hello, The error is due to the exhausted thread quota on the DN side. Usually this can be controlled using the balancer parameters. Kindly refer https://docs.cloudera.com/HDPDocuments/HDP3/HDP-3.0.1/data-storage/content/properties_for_configuring_the_balancer.html Ideally changing the values for "dfs.datanode.balance.max.concurrent.moves" should help come out of the issue. n/w bandwidth can become an issue while we are dealing with large volume of data movement but according to this error it's on the quota.

Akarsh · ‎04-04-2022

Hello If the query is resolved can you kindly mark this as done?

Akarsh · ‎01-06-2022

First of validate in zookeeper if there are entries for the hbase id. There is another easy way to wipe the slate clean bin/hbase clean Select the options -cleanAll which will delete HDFS data and also the zookeeper data. This should clean the things and get the things going. ** Make sure to stop the Hbase service when you are doing this. OR You can use -cleanZk option to delete only the zookeeper data and re populate the same. Steps remain the same, bring down the Hbase service and run these commands from admin/master nodes. **These actions can't be reverted.

Akarsh · ‎09-17-2021

Hello If you have unbalanced disks in cluster please use the interdisk balancer. So usually it would be in RoundRobin fashion and since few disks are smaller when compared to other we are running into issues. Please refer below doc: https://blog.cloudera.com/how-to-use-the-new-hdfs-intra-datanode-disk-balancer-in-apache-hadoop/ We can use the parameter for available space. Usually the HDFS balancer uses DataNode balance by specified %. So it considers the overall usage of the DataNode rather than the individual disks on the DataNode.

Akarsh · ‎11-29-2020

When we restart the JournalNode Quorum the epoch number will change. We usually see that the errors when the JournalNodes are not in sync. Check for the writer epoch on current dir for JournalNode process, which one of the JournalNodes is lacking we can manually copy the files from working JournalNode and it will pick up. This should happen automatically when we restart the JournalNodes, if not then above is the procedure.

Online	Offline
Last Visited	‎11-28-2022 02:15 PM

Member Since	‎01-07-2020 04:53 AM
Last Visited	‎11-28-2022 02:15 PM
Posts	36

Cloudera Community

Re: URGENT case: Failed to place enough replicas, ...

Re: Rebalance HDFS - Not able to copy block 131421...

Re: Rebalance HDFS - Not able to copy block 131421...

Re: URGENT case: Failed to place enough replicas, ...

Re: Cannot restore hbase.id and hbase.version

Re: How HDFS balancer works ?

Re: How QJM Works in Namenode HA