Everything has worked smoothly for the most part until I reached the section about running an example application with YARN:
Run an example Hadoop job to grep with a regular expression in your input data.
$ hadoop jar /usr/lib/hadoop-mapreduce/hadoop-mapreduce-examples.jar grep input output23 'dfs[a-z.]+'
At this point I get the following warning messages and the job hangs:
$ hadoop jar /usr/lib/hadoop-mapreduce/hadoop-mapreduce-examples-2.6.0-cdh5.6.0.jar grep input output23 'dfs[a-z.]+'
16/03/15 09:28:58 INFO client.RMProxy: Connecting to ResourceManager at /0.0.0.0:8032
16/03/15 09:29:00 WARN mapreduce.JobResourceUploader: No job jar file set. User classes may not be found. See Job or Job#setJar(String).
16/03/15 09:29:00 INFO input.FileInputFormat: Total input paths to process : 4
16/03/15 09:29:01 INFO mapreduce.JobSubmitter: number of splits:4
16/03/15 09:29:01 INFO mapreduce.JobSubmitter: Submitting tokens for job: job_1458035550745_0007
16/03/15 09:29:01 INFO mapred.YARNRunner: Job jar is not present. Not adding any jar to the list of resources.
16/03/15 09:29:01 INFO impl.YarnClientImpl: Submitted application application_1458035550745_0007
16/03/15 09:29:02 INFO mapreduce.Job: The url to track the job: http://fembuntu:8088/proxy/application_1458035550745_0007/
16/03/15 09:29:02 INFO mapreduce.Job: Running job: job_1458035550745_0007
I am not sure how I can resolve this. I tried setting and unsetting HADOOP_MAPRED_HOME and a few other things but to no avail. I also took a look at the url to track the job and all I could see was that the apps were in a pending state and nothing was running. What am I missing?
Cluster Metrics Apps Submitted Apps Pending Apps Running Apps Completed Containers Running Memory Used Memory Total Memory Reserved VCores Used VCores Total VCores Reserved Active Nodes Decommissioned Nodes Lost Nodes Unhealthy Nodes Rebooted Nodes