Member since
04-03-2019
962
Posts
1743
Kudos Received
146
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
11427 | 03-08-2019 06:33 PM | |
4866 | 02-15-2019 08:47 PM | |
4152 | 09-26-2018 06:02 PM | |
10544 | 09-07-2018 10:33 PM | |
5592 | 04-25-2018 01:55 AM |
02-14-2017
04:32 PM
2 Kudos
@Bhavesh Gadoya --> Please check if your RM is up and running --> If RM is running, please verify if it's listening on 8088 by running netstat command from RM host. --> If RM is listening on 8088, please see if you are able to connect to RM by remote host on 8088 by telnet --> If you are not able to connect to RM by telnet then there must be some firewall which is blocking port 8088 --> If RM is down, check /var/log/hadoop-yarn/yarn/yarn-yarn-resourcemanager-<hostname>.log
... View more
02-14-2017
03:21 PM
@Jake Kugel - Please have a look at my answer and accept it if your job runs successfully!
... View more
02-14-2017
03:20 PM
1 Kudo
@Jake Kugel
I was able to reproduce this on my local cluster and resolve it. You need to add below property in job.properties file. oozie.action.sharelib.for.sqoop=sqoop,hive Note - Sqoop uses CliDriver class and does not use hive script whereas Oozie was not able to find that class in Classpath hence it was trying to use hive cli. Also, To avoid further issues, please add hive-site.xml in your workflow.xml <file>$some_location_on_hdfs/hive-site.xml#hive-site.xml</file> Credit goes to @pjoseph for finding the reason why this was happening! 🙂
... View more
02-10-2017
02:11 PM
@Ion Drugalea if below answer helped you, please accept the answer 🙂
... View more
02-09-2017
05:49 PM
I did the bit of research and looked into the code and found that currently there is no TIMEOUT parameter on Oozie level. I have raised an internal enhancement request for this. ##Snipped from JavaActionExecutor.java## try {
Element actionXml = XmlUtils.parseXml(action.getConf());
FileSystem actionFs = context.getAppFileSystem();
JobConf jobConf = createBaseHadoopConf(context, actionXml);
jobClient = createJobClient(context, jobConf);
RunningJob runningJob = getRunningJob(context, action, jobClient);
if (runningJob == null) {
context.setExecutionData(FAILED, null);
throw new ActionExecutorException(ActionExecutorException.ErrorType.FAILED, "JA017",
"Unknown hadoop job [{0}] associated with action [{1}]. Failing this action!", action
.getExternalId(), action.getId());
} protected RunningJob getRunningJob(Context context, WorkflowAction action, JobClient jobClient) throws Exception{
RunningJob runningJob = jobClient.getJob(JobID.forName(action.getExternalId()));
return runningJob;
} ##Snippet from Mapreduce code(JobClient.java)## public RunningJob getJob(JobID jobid) throws IOException {
JobStatus status = jobSubmitClient.getJobStatus(jobid);
JobProfile profile = jobSubmitClient.getJobProfile(jobid);
if (status != null && profile != null) {
return new NetworkedJob(status, profile, jobSubmitClient);
} else {
return null;
}
} ##Snippet from JobSubmissionProtocol.java (mapreduce code)## /**
* Grab a handle to a job that is already known to the JobTracker.
* @return Status of the job, or null if not found.
*/
public JobStatus getJobStatus(JobID jobid) throws IOException; So I got answer to my question! 🙂
... View more
02-09-2017
01:08 PM
@Sreeviswa Athikala Can you please let us know what all services you have on to-be-migrated host?
... View more
02-09-2017
12:09 PM
@hardik desai / @Harshit Shah Take backup of below file --> edit it on ranger admin host --> restart ambari-agent and try again #Backup cp /var/lib/ambari-agent/cache/custom_actions/scripts/check_host.py /var/lib/ambari-agent/cache/custom_actions/scripts/check_host.py.bak #Edit vi /var/lib/ambari-agent/cache/custom_actions/scripts/check_host.py #Comment below code # download DBConnectionVerification.jar from ambari-server resources
try:
download_file(check_db_connection_url, check_db_connection_path)
except Exception, e:
message = "Error downloading DBConnectionVerification.jar from Ambari Server resources. Check network access to " \
"Ambari Server.\n" + str(e)
print message
db_connection_check_structured_output = {"exit_code" : 1, "message": message}
return db_connection_check_structured_output #Save and restart ambari agent Let me know the results. This is hacky approach and I would not recommend it for Production clusters. We need to investigate what's happening. If you have support subscription, please open a support case with us 🙂
... View more
02-09-2017
11:19 AM
@hardik desai - Please check updated answer above ^^
... View more
02-09-2017
10:53 AM
@hardik desai Can you please verify connectivity between Ranger admin and Ambari server host? Telnet from Ranger Admin host to Ambari Server on 8080? 503 error code - looks like Ambari Server is too busy to respond or there is network issue between Ranger Admin and Ambari Server. Can you manually copy DBConnectionVerification.jar from ambari-server to ambari-agent on ranger admin host and restart ambari-agent followed by retry DB connection. On Ambari-server DBConnectionVerification.jar should present at below location /var/lib/ambari-server/resources/DBConnectionVerification.jar On Ambari-agent, copy DBConnectionVerification.jar at below location /usr/lib/ambari-agent/DBConnectionVerification.jar
/var/lib/ambari-agent/tmp/DBConnectionVerification.jar
/var/lib/ambari-agent/cache/DBConnectionVerification.jar You can also try to bounce Ambari Server and see if it helps. Please keep us posted. Thanks @Sagar Shimpi
... View more
02-08-2017
12:02 PM
@Predrag Minovic - Yes that's correct. but if JHS is being restarted for some reason and Oozie tries to connect to JHS, jobs will get failed. I'm looking for timeout parameter which can hold jobs until JHS is back
... View more