About Clint

Clint · ‎05-05-2014

Thanks for reporting the solution back to this thread, Murthy. Glad it's resolved!

Priyabrata · ‎04-16-2014

Finally resolved the Issue. Due to space crunch on root mount space we have created a soft link for /opt/cloudera/ to point to /data/opt/cloudera/ As a result of this the the local repo path was changed as well. Hence while installing package it was trying to download the packages again and again. But unable to find the same on the disk as correctly distributed by CM. As I changed the local repo directory in CM Administration/parcel-repo path Every thing started working as required. Thanks every one for your support and suggestions. Priyabrata Patnaik

kiyengar · ‎04-07-2014

So one more question I had. Is it purely a non-functional performance consideration based on workloads? Is it ever a concern that any of the software components in the Cloudera stack would actually cause job failures (or even worse successful completions by creating a corrupt dataset) through mixing say bonded 1GE and 10GE racks of servers? We're running HBase, MapReduce and very light impala on our cluster of over 60 nodes, and we're thinking of moving to 10GE for nodes 60 - 100. But we're not sure if we should also upgrade the existing 60 nodes. We'll do some investigation now to determine whether our jobs are network bound. But there doesn't seem to be an easy way of measuring other than through the Chart views and looking at total bytes received on all interfaces across time across each node. Any other suggestions? Would anyone recommend that in order to move to 10GE networking that all potential components of the solution MUST be upgraded? Or is it purely a call to be made based on the performance attributes of jobs running?

jkestelyn · ‎04-02-2014

Cloudera Manager is proprietary software, source code is not available. Sorry!

AndrewC · ‎04-01-2014

Hi just to follow up on this, I have now solved the problem. There were two things that I needed to do: 1. In addition to adding oozie.libpath to my job.properties, I also needed to include oozie.use.system.libpath=true 2. Before I was using the following line to add files to the DistributedCache: FileStatus[] status = fs.listStatus("/application/lib"); if (status != null) { for (int i = 0; i < status.length; ++i) { if (!status[i].isDir()) { DistributedCache.addFileToClassPath(status[i].getPath(), job.getConfiguration(), fs); } } } This appeared to be causing a classpath issue because it was adding hdfs://hostname before the hdfs path. Now I am using the following to remove that and only add the absolute hdfs path: FileStatus[] status = fs.listStatus("/application/lib"); if (status != null) { for (int i = 0; i < status.length; ++i) { if (!status[i].isDir()) { Path distCachePath = new Path(status[i].getPath().toUri().getPath()); DistributedCache.addFileToClassPath(distCachePath, job.getConfiguration(), fs); } } } Thankyou to those that replied to my original query for pointing me in the right direction. Andrew

huangxing · ‎03-31-2014

I have set "yarn.nodemanager.delete.debug-delay-sec" to 6000,and the container log dir is : <property> <name>yarn.nodemanager.log-dirs</name> <value>/hadoop/hadoop-2.0.0-cdh4.5.0/yarn/containers</value> </property> <property> <description>Where to aggregate logs</description> <name>yarn.nodemanager.remote-app-log-dir</name> <value>/var/log/hadoop-yarn/app</value> </property> The dir /hadoop/hadoop-2.0.0-cdh4.5.0/yarn/containers has nothing after running the task,I found the configruation in yarn-site.xml never take effect.

javabrett · ‎03-30-2014

For the benefit of others that may encounter this, the root cause of this problem was eventually identified. The problem was caused by the SSH-client-launched remote command running under a much older version of "bash", a version that had the (temporary) problem of not exporting the SSH_CLIENT variable to the environment. How can this happen and be obscure? It turns out that when the CM executes "ssh 'bash -c ...'", the remote SSH server relies on a static search PATH to locate "bash", which may be different from the path you pick-up with interactive shells. To check if you have this (unlikely) problem, run this from a machine remote from the target machine: $ ssh you@yourmachine.com 'which bash' /usr/local/bin/bash $ ssh you@yourmachine.com 'bash --version' GNU bash, version 2.05.8(1)-release (i386-redhat-linux-gnu) $ ssh you@yourmachine.com 'env | grep SSH_CLIENT' SSH_CLIENT=10.1.2.3 56617 22 $ ssh you@yourmachine.com 'bash -c "env | grep SSH_CLIENT"' (nothing) Note the really old version of bash reported here for me, and the non-standard path. Then when "bash" is explicitly invoked when checking SSH_CLIENT, it is missing. You can compare this to the results from an interactive shell session. The version of bash above and some other versions around the same time do not correctly export SSH_CLIENT. The fix for this is eliminate the bad version of bash from the target machine. Brett

Clint · ‎03-27-2014

No problem at all, welcome to our community!

Nishan · ‎03-26-2014

Hey Clint, Thanks man.I did create the solr user and installation worked.I am trying to figure out why it did not create the user.

Nishan · ‎03-19-2014

It was a problem with specifying the TNS name.Got That resolved.Thank you all.

Online	Offline
Last Visited	‎08-14-2018 09:28 AM

Member Since	‎06-26-2013 05:20 PM
Last Visited	‎08-14-2018 09:28 AM
Posts	416
Kudos received	93

Cloudera Community

Re: CDH downgrade from 5.5 to 5.3

Re: Add multiple emails to my profile

Re: Uprade spark 0.9 to latest using Cloudera mana...

Re: Will impala support xml data type?

Re: Add new hosts failure: No packages found match...

Re: org.apache.hadoop.hbase.client.NoServerForRegi...

Re: New Node Addition Failure process hangs

Re: Technology Hardware Advancement

Re: what are the technologies used in cloudera man...

Re: ClassNotFoundException when running a MapReduc...

Re: (hadoop-2.0.0-cdh4.5.0)The yarn example runs f...

Re: Problem with scm_prepare_node.sh locating the ...

Re: community post limited to 100 characters ??

Re: Solr is not starting

Re: Unable to verify database connection