2206
Posts
230
Kudos Received
82
Solutions
About
My expertise is not in hadoop but rather online communities, support and social media. Interests include: photography, travel, movies and watching sports.
My Accepted Solutions
Title | Views | Posted |
---|---|---|
440 | 05-07-2025 11:41 AM | |
905 | 02-27-2025 12:49 PM | |
2777 | 06-29-2023 05:42 AM | |
2363 | 05-22-2023 07:03 AM | |
1729 | 05-22-2023 05:42 AM |
07-23-2016
05:07 AM
3 Kudos
I spoke with some of my contacts about this one and here is their response. I hope it helps.
This warning message indicates a potential performance problem which may be occurring for different reasons, from disk/network latency to high CPU load to GC pauses, to mention a few. Based on our earlier experience, I suggest to check/verify the followings: 1. the latency of the network services the Standby NameNode (LDAP/AD, NTP, DNS) uses 2. the possible disk overload (ideally dedicate individual disks to separate the IO loads of the QuorumJournalNode [edit logs storage], NameNode [checkpointing!], and Zookeeper [znode persistency] services), thus the use of NFS mounted storage should be avoided 3. check/verify the GC activity of the Standby NameNode process ('jstat' command, service logs) by running the following two commands in parallel on the Standby NameNode until after you receive another alert in Cloudera Manager: jstat -gc -t -h30 <SBNN JVMPID> 2s jstat -gcutil -t -h30 <SBNN JVMPID> 2s 4. corresponding to the occasionally high GC activity, you may need to increase the heap size on both NameNodes 5. the RPC handler counts should also be set properly to match the occasional large list loads (similar to 'hadoop fsck /'), which could increase latencies if run too often
Generally speaking, the increased RPC latency has two parts, the average time the requests spend in the queue (controlled by the NameNode Handler Count property) and the time needed to process the requests. The length of this latter depends on the performance of the HDFS metadata (edit logs, fsimage) directory. The Cloudera Manager Healt Check alert message contains both the queue and the processing times.
In cases of extremely high activity, such as an attempt to decommission then recommission multiple datanodes or a large number of YARN reducers or Flume/Sqoop data ingestion processes or HBase bulk data load, a lot of edit logs can be generated by the Active Namenode. The process of synchronizing the edits with each JournalNode and sending them to the Standby NameNode and the Standby Namenode checkpointing can be highly I/O hungry. While the Standby NameNode is checkpointing it is not accepting edits from the JournalNodes. The JournalNodes might be having trouble keeping in sync which delayed edits being relayed to the Standby NameNode. This in turn can result in network latencies/delays on the Standby NameNode.
The "rpc_call_queue_len_avg" graphs for the NameNode can also be checked to see if it has any continuous spikes or curves. Ideally that should be 0, indicating that the handlers are sufficient. If not, the value of the 'dfs.datanode.handler.count', the 'dfs.namenode.handler.count' and the 'dfs.namenode.service.handler.count' properties can be bumped. The values of the 'dfs.namenode.handler.count' and the 'dfs.namenode.service.handler.count' both should be the ln (# of cluster nodes)*20 while the 'dfs.datanode.handler.count' is the tenth of these values.
Finally, there can be another special condition when Cloudera Manager health check emits this alert: if the NameNode Health Check interferes with the regular NameNode checkpointing.
... View more
07-22-2016
05:42 AM
The first thing to look at is the amount of RAM allocated to the VM. If you are using Cloudera Manager you need a minimum of 8gb of RAM. Depending on what you are doing with the VM you may need to go above the minimum.
... View more
07-22-2016
04:52 AM
I am happy to hear that you are now up and running. Best of luck.
... View more
07-21-2016
08:42 AM
1 Kudo
If you set up the VM manually, you may have accidentally selected a 32bit OS as several are available.
The easiest way to create the VM is outined in the Community Article How to setup Cloudera Quickstart Virtual Machine.
Using the specific documentation and instructions provided by your hypervisor application, open the extracted file into that hypervisor application. For example, if you elected to use VirtualBox, you would have downloaded and extracted a *.ovf file from Cloudera. Use the “File -> Import Appliance” menu inside VirtualBox to open your downloaded *.ovf file, or simply double-click on the file itself and VirtualBox should handle it from there.
... View more
07-21-2016
05:20 AM
Welcome to the community @sanjeevn. Have you checked your operating system to ensure that it is 64bit as well?
... View more
07-20-2016
05:35 AM
@sim6
Thanks for asking about the ability to bump up posts. The forums are a peer to peer platform but Clouderan's do participate from time to time. If you need a something more, perhaps the Cloudera Developer Program would be a match to your needs.
... View more
07-19-2016
07:04 AM
1 Kudo
Thank you for the question @veeruCloudera. If you look at the CCP Data Scientist Exams page on Cloudera.com you will see that you have to pass all three exams in a 365 day time frame. There are other criteria involved that are also listed on that page. Best of luck.
... View more
07-18-2016
12:26 PM
@Sruthi223 Since this is an older post, it may be benificial to start a new topic for your particular issue as the solution outlined here did not work for you. Make sure to provide all relevent information in the new post.
... View more
07-14-2016
05:10 AM
There is some great discussion here. @singhuda have you resolved the original issue?
... View more