About Clint

Clint · ‎11-04-2013

You are correct. It's either the NFS-based shared edits directory OR the QJM-based HA config.

Clint · ‎11-04-2013

Yes, that's the way the process still happens. Once you get the HDFS service installed and running, setting up HA is a separate workflow allowing you to choose your fencing mechanism, manual or automatic failover, quorum journal nodes, etc. I believe this is the doc you will need.

Clint · ‎11-01-2013

@happynodes I have moved your thread to the Cloudera Manager board, because you mentioned that you were using Parcels and as far as I know, that is a CM specific packaging model. To answer your question, there are mechanisms in CM all over the place to control the size and number of log files that are retained. Please be aware that each and every Hadoop service, as well as the Cloudera Manager management/monitoring services will all keep their own logs, and by default those end up in /var/log. For example, if you browse to your hdfs1 service page and click on "Configuration->View and Edit". On the left-hand side, you will be able to expand several menus and see "Logs" sections, which allow you to configure the logging of that service. Datanode, Failover Controller, and Namenode are a few examples of such. What I would recommend is to try to identify which actual directory under /var/log is the culprit here and then go into CM and adjust the log retention settings for that service.

Clint · ‎10-31-2013

The hdfs-site.xml file that you are viewing in CM which @smark helped you to find, resides on the local filesystem of that remote datanode. It will not be in /etc/hadoop/conf, though (unless you re-deploy your client configs to that machine), as CM maintains its own configuration directory in /var/run/cloudera-scm-agent/process for the roles that it manages. You will find the hdfs-site.xml file under that directory in the latest ???-Datanode directory.

Clint · ‎10-30-2013

Thank you for the report @ManishChopra , we had a temporary portal issue this morning, but we have resolved it now. Your feedback is very much appreciated!

Clint · ‎10-28-2013

Could it have something to do with old cache files not being cleaned out from before when you made the change? I think there is a mechanism for retiring these old files and moving them off/deleting them, but I'm not positive if that applies to the actual jobcache files. Maybe this blog contains the clue? http://blog.cloudera.com/blog/2010/11/hadoop-log-location-and-retention/

Clint · ‎10-28-2013

When you are running on a single machine, you must set the "replication" factor (dfs.replication) to 1, since the default is 3 and there are not 3 datanodes in your cluster, HDFS will just sit there trying to replicate blocks that it cannot. See below from your fsck output: Default replication factor: 3 Under-replicated blocks: 126 (100.0 %) If you restart the cluster with replication set to one, the cluster should report healthy again.

Clint · ‎10-21-2013

do you have the zookeeper.quorum property I mentioned previously in your /etc/hbase/conf/hbase-site.xml file on these systems? It sounds like your hbase clients (any app trying to access the HBase service) are trying to use the default property for the ZK quorum, which would have them looking on the localhost for a ZK server. This is why it works on nodes that are running a ZK instance. You need a valid hbase-site.xml file on each node that specifies the ZK quorum. It was described in that link I posted. I hope that helps.

Clint · ‎10-21-2013

Can you give a bit more detail as to what you are doing when you encounter this error? And is the machine where you are seeing this one of those 8 nodes in the cluster? Or an external machine? I've seen this before when a client app outside the cluster was unable to connect to the zookeeper quorum because a local copy of the hbase-site.xml file was not in the application's path therefore it did not know who the zookeeper servers were and the error looks like yours. The property that needs to be specified for the client is: hbase.zookeeeper.quorum. http://hbase.apache.org/book/zookeeper.html

Clint · ‎10-16-2013

Thanks for closing the loop with us and posting back the solution, JakeZ

Online	Offline
Last Visited	‎08-14-2018 09:28 AM

Member Since	‎06-26-2013 05:20 PM
Last Visited	‎08-14-2018 09:28 AM
Posts	416
Kudos received	93

Cloudera Community

Re: CDH downgrade from 5.5 to 5.3

Re: Add multiple emails to my profile

Re: Uprade spark 0.9 to latest using Cloudera mana...

Re: Will impala support xml data type?

Re: Add new hosts failure: No packages found match...

Re: Help with Role Assignments / New Install

Re: Help with Role Assignments / New Install

Re: Size of the /var/log disk is running out of co...

Re: where to store the new value of "dfs.datanode....

Re: When signed into, site URL https://www.clouder...

Re: How to limit jobcache foledr size?

Re: Fsck command error

Re: hbase.MasterNotRunningException

Re: hbase.MasterNotRunningException

Re: Setting the NameNode port (8020) to listen out...