About cjervis

cjervis · ‎08-18-2016

Looking over the CCA Spark and Hadoop Developer Certification page at the bottom is a exam delivery and cluster information section with the following information. All other websites, including Google/search functionality is disabled. You may not use notes or other exam aids.

cjervis · ‎08-17-2016

Sorry about the delay in responding @Megh. If you look at the DE575 certification page on the Cloudera website you can see that the cluster setup includes Cloudera HUE.

cjervis · ‎08-12-2016

I'm happy to see that you were able to resolve the issue. I'm also impressed with your use of the profile card option. It really makes you stand out in the crowd. 🙂

cjervis · ‎07-29-2016

@DevenV I'll reach out to one of my contacts in training and see if they can respond.

cjervis · ‎07-29-2016

Great! Best of luck going forward. 🙂

cjervis · ‎07-28-2016

Did you create the VM and point it to the quickstart file or did you use import appliance as mentioned in step 4 of our community article How to setup a Cloudera Quickstart Virtual Machine?\ Using the specific documentation and instructions provided by your hypervisor application, open the extracted file into that hypervisor application. For example, if you elected to use VirtualBox, you would have downloaded and extracted a *.ovf file from Cloudera. Use the “File -> Import Appliance” menu inside VirtualBox to open your downloaded *.ovf file, or simply double-click on the file itself and VirtualBox should handle it from there.

cjervis · ‎07-23-2016

I spoke with some of my contacts about this one and here is their response. I hope it helps. This warning message indicates a potential performance problem which may be occurring for different reasons, from disk/network latency to high CPU load to GC pauses, to mention a few. Based on our earlier experience, I suggest to check/verify the followings: 1. the latency of the network services the Standby NameNode (LDAP/AD, NTP, DNS) uses 2. the possible disk overload (ideally dedicate individual disks to separate the IO loads of the QuorumJournalNode [edit logs storage], NameNode [checkpointing!], and Zookeeper [znode persistency] services), thus the use of NFS mounted storage should be avoided 3. check/verify the GC activity of the Standby NameNode process ('jstat' command, service logs) by running the following two commands in parallel on the Standby NameNode until after you receive another alert in Cloudera Manager: jstat -gc -t -h30 <SBNN JVMPID> 2s jstat -gcutil -t -h30 <SBNN JVMPID> 2s 4. corresponding to the occasionally high GC activity, you may need to increase the heap size on both NameNodes 5. the RPC handler counts should also be set properly to match the occasional large list loads (similar to 'hadoop fsck /'), which could increase latencies if run too often Generally speaking, the increased RPC latency has two parts, the average time the requests spend in the queue (controlled by the NameNode Handler Count property) and the time needed to process the requests. The length of this latter depends on the performance of the HDFS metadata (edit logs, fsimage) directory. The Cloudera Manager Healt Check alert message contains both the queue and the processing times. In cases of extremely high activity, such as an attempt to decommission then recommission multiple datanodes or a large number of YARN reducers or Flume/Sqoop data ingestion processes or HBase bulk data load, a lot of edit logs can be generated by the Active Namenode. The process of synchronizing the edits with each JournalNode and sending them to the Standby NameNode and the Standby Namenode checkpointing can be highly I/O hungry. While the Standby NameNode is checkpointing it is not accepting edits from the JournalNodes. The JournalNodes might be having trouble keeping in sync which delayed edits being relayed to the Standby NameNode. This in turn can result in network latencies/delays on the Standby NameNode. The "rpc_call_queue_len_avg" graphs for the NameNode can also be checked to see if it has any continuous spikes or curves. Ideally that should be 0, indicating that the handlers are sufficient. If not, the value of the 'dfs.datanode.handler.count', the 'dfs.namenode.handler.count' and the 'dfs.namenode.service.handler.count' properties can be bumped. The values of the 'dfs.namenode.handler.count' and the 'dfs.namenode.service.handler.count' both should be the ln (# of cluster nodes)*20 while the 'dfs.datanode.handler.count' is the tenth of these values. Finally, there can be another special condition when Cloudera Manager health check emits this alert: if the NameNode Health Check interferes with the regular NameNode checkpointing.

cjervis · ‎07-22-2016

I sent you a PM for further information @ben123. 🙂

cjervis · ‎07-22-2016

The first thing to look at is the amount of RAM allocated to the VM. If you are using Cloudera Manager you need a minimum of 8gb of RAM. Depending on what you are doing with the VM you may need to go above the minimum.

cjervis · ‎07-22-2016

I am happy to hear that you are now up and running. Best of luck.

Online	Online
Last Visited	‎01-28-2026 08:32 AM

Name	Cy Jervis
Location	Lecanto, Fl
Member Since	‎04-06-2015 02:01 PM
Last Visited	‎01-28-2026 08:32 AM
Posts	2,213
Total Tags	1742
Kudos received	199

Cloudera Community

Re: Partner Developer License Request

Re: Where are Cloudera Blogs?

Re: Apache Nifi: how to ensure record order betwee...

Re: Nifi does not read data from the topic KAFKA o...

Re: Automate the deployment of a model

Re: Can I use google when taking CCA exams?

Re: Hue Web UI is available for CCP Data Engineer ...

Re: How to add Flume as a service in CDH 5.7.0

Re: Considering signing up for Cloudera on-Demand ...

Re: CDH 5.7 doesn't launch on VirtualBox5.1

Re: CDH 5.7 doesn't launch on VirtualBox5.1

Re: Standby Namenode is getting RPC latency bad he...

Re: change account email

Re: Cloudera QuickStart VM - Performance Issues - ...

Re: cloudera-quickstart-vm-5.7.0-0-virtualbox not ...