About ArpitAgarwal

ArpitAgarwal · ‎04-24-2017

Hi @Michael Häusler, this may be caused by HDFS-9958. I see that HDFS-9958 is not fixed in HDP 2.4.2 but it was fixed in HDP 2.4.3. If you can see this consistently I'd recommend upgrading to check whether that fixes the problem. If you have a support contract we can provide you with a hotfix release.

ArpitAgarwal · ‎04-23-2017

Check your rack setting for the DataNode. If you don't see the problem you can post the output of the following command and someone may be able to point out the error. hdfs dfsadmin -report

ArpitAgarwal · ‎04-18-2017

Hi @Sedat Kestepe, take a look at rack awareness. https://hadoop.apache.org/docs/r2.7.2/hadoop-project-dist/hadoop-common/RackAwareness.html Here's how you can configure racks using Ambari https://docs.hortonworks.com/HDPDocuments/Ambari-2.2.0.0/bk_Ambari_Users_Guide/content/ch03s11.html HDFS will avoid placing all block replicas in the same rack to avoid data loss in case of a rack failure. You may be able to use this to achieve what you want.

ArpitAgarwal · ‎04-06-2017

Using RAID can reduce the availability and fault tolerance of HDFS. It certainly reduces the overall performance as compared to JBOD. We strongly recommend configuring your disks as JBOD since HDFS already stores data redundantly by replicating across nodes/racks and can automatically recover from disk and node failures.

ArpitAgarwal · ‎04-05-2017

I'd also post this question on the Ambari track to check why Ambari didn't detect the DataNodes doing down. Also from your logs it is hard to say why the DataNode went down. I again recommend increasing the DataNode heap allocation via Ambari. Also check that your nodes are provisioned with sufficient amount of RAM.

ArpitAgarwal · ‎04-05-2017

Ok looks like you have automatic failover enabled. I am not sure why you get the EOFException. Look through your NameNode logs to see if there are any errors.

ArpitAgarwal · ‎04-05-2017

The Mover will move blocks within the same node when possible and thus try to avoid network activity. If that is not possible (e.g. when a node doesn't have SSD or when the local SSDs are full), it will move block replicas across the network to another node that has the target media. I've edited my answer.

ArpitAgarwal · ‎04-04-2017

@Riccardo Iacomini, are you asking about the HDFS move/rename command? Move is purely a metadata operation on the NameNode and does not result in any data movement until the HDFS Mover utility is run. https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-hdfs/ArchivalStorage.html#Mover_-_A_New_Data_Migration_Tool Edit: The Mover will move blocks within the same node when possible and thus try to avoid network activity. If that is not possible (e.g. when a node doesn't have SSD or when the local SSDs are full), it will move block replicas across the network to another node that has the target media.

ArpitAgarwal · ‎04-03-2017

You likely have Kerberos enabled. The DataNode process starts as root so it can bind a privileged port (<1024) for data transfer. Then it launches another process as user hdfs. You should not kill either process. The "refused to connect" error looks like some network connectivity issue in your environment, or you are hitting the wrong port number. See if you can find the correct info port from either configuration or from the DataNodes tab of the NameNode web UI.

ArpitAgarwal · ‎04-03-2017

Are you using HDP and did you enable NameNode HA using Ambari? If so then you should have automatic failover configured. Automatic Failover requires the ZooKeeper service instances and ZooKeeper FailoverControllers to be up and running. If you setup HA manually, then you may need to transition one of the NNs to active status manually as described here: https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-hdfs/HDFSHighAvailabilityWithQJM.html

Online	Offline
Last Visited	‎11-03-2023 01:06 PM

Member Since	‎07-30-2019 10:45 AM
Last Visited	‎11-03-2023 01:06 PM
Posts	111
Kudos received	185

Cloudera Community

Re: What is active and passive NameNode in Hadoop?

Re: NameNode heapsize is bigger then it should be.

Re: Delete old BP-* DataNode directories by hand?

Re: NameNode edit logs - purging/Best practises

Re: Hadoop 3.0 in a Virtual Box for beginners

Re: How to debug a NullPointerException on "hdfs d...

Re: DATA NODE was removed from Ambari - Due to Ra...

Re: Is it possible to define a replication plan am...

Re: Do we config our hadoop right? JBOD vs RAID

Re: Datanode Failures: DataXceiver error processin...

Re: two name nodes are stand by after configuring ...

Re: HDFS tiered storage - network usage

Re: HDFS tiered storage - network usage

Re: Hadoop datanode jmx metrics pages

Re: two name nodes are stand by after configuring ...