Community Articles

Find and share helpful community-sourced technical articles.
Announcements
Celebrating as our community reaches 100,000 members! Thank you!
Labels (2)
avatar
Master Mentor

Part 1: https://community.hortonworks.com/articles/82964/getting-started-with-apache-ambari-workflow-design....

Part 2: https://community.hortonworks.com/articles/82967/apache-ambari-workflow-designer-view-for-apache-oo....

Part 3: https://community.hortonworks.com/articles/82988/apache-ambari-workflow-designer-view-for-apache-oo-...

Part 4: https://community.hortonworks.com/articles/83051/apache-ambari-workflow-designer-view-for-apache-oo-...

Part 5: https://community.hortonworks.com/articles/83361/apache-ambari-workflow-manager-view-for-apache-ooz....

Part 7: https://community.hortonworks.com/articles/84071/apache-ambari-workflow-manager-view-for-apache-ooz-...

Part 8: https://community.hortonworks.com/articles/84394/apache-ambari-workflow-manager-view-for-apache-ooz-...

Part 9: https://community.hortonworks.com/articles/85091/apache-ambari-workflow-manager-view-for-apache-ooz-...

Part 10: https://community.hortonworks.com/articles/85354/apache-ambari-workflow-manager-view-for-apache-ooz-...

Part 11: https://community.hortonworks.com/articles/85361/apache-ambari-workflow-manager-view-for-apache-ooz-...

Part 12: https://community.hortonworks.com/articles/131389/apache-ambari-workflow-manager-view-for-apache-ooz...

In this tutorial, we're going to leverage Oozie's SLA monitoring features via Workflow Manager. To read more about SLA features in Oozie, please look at the official documentation https://oozie.apache.org/docs/4.2.0/DG_SLAMonitoring.html

We will begin with a simple shell action that will sleep for duration of time. Create a new file called script.sh and paste add the code below.

echo “start of script execution”
sleep 60
echo “end of script execution”

We're also going to create a workflow HDFS directory and upload this script to it.

hdfs dfs –mkdir oozie/shell-sla
hdfs dfs -put script.sh oozie/shell-sla/

Let's begin with adding a shell action and populating the script name and file attribute.

12531-1-shell-action.png

Don't forget to check the capture output box. We want to see the output of this action.

12532-2-capture-output.png

I want to submit the workflow to make sure everything works as expected before configuring SLA features.

12533-3-submit-wf.png

It's a good idea to preview the XML to make sure file tag and exec tags are filled correctly.

12534-4-preview-xml.png

Once job completes, I want to drill down to the job and view the output.

12535-5-review-job-result.png

Everything looks good, we're ready to enable SLA features of Oozie via Workflow Manager. Click on the shell action and then gear icon. At the bottom of the configuration page, you will see SLA section. Expand that and check the enabled box.

12536-8-enable-sla-wfm.png

Each field is described as below:

  • nominal-time: As the name suggests, this is the time relative to which your jobs' SLAs will be calculated. Generally since Oozie workflows are aligned with synchronous data dependencies, this nominal time can be parameterized to be passed the value of your coordinator nominal time. Nominal time is also required in case of independent workflows and you can specify the time in which you expect the workflow to be run if you don't have a synchronous dataset associated with it.
  • should-start: Relative to nominal-time this is the amount of time (along with time-unit - MINUTES, HOURS, DAYS) within which your job should start running to meet SLA. This is optional.
  • should-end: Relative to nominal-time this is the amount of time (along with time-unit - MINUTES, HOURS, DAYS) within which your job should finish to meet SLA.
  • max-duration: This is the maximum amount of time (along with time-unit - MINUTES, HOURS, DAYS) your job is expected to run. This is optional.
  • alert-events: Specify the types of events for which Email alerts should be sent. Allowable values in this comma-separated list are start_miss, end_miss and duration_miss. *_met events can generally be deemed low priority and hence email alerting for these is not necessary. However, note that this setting is only for alerts via email alerts and not via JMS messages, where all events send out notifications, and user can filter them using desired selectors. This is optional and only applicable when alert-contact is configured.
  • alert-contact: Specify a comma separated list of email addresses where you wish your alerts to be sent. This is optional and need not be configured if you just want to view your job SLA history in the UI and do not want to receive email alerts.

I'm going to simulate each one of the SLA patterns, i.e. my job started later than scheduled, my job completed outside the SLA threshold and finally, my job took longer to complete than we were expecting. To fill out the nominal time, feel free to choose the date and clock icon below the date picker for correct time. Click x when ready.

12538-29-clock-picker2.png

Finally, I'd like to change my script to run for 120 seconds instead of 60 to simulate long duration.

My script should look like so:

echo “start of script execution”
sleep 120
echo “end of script execution”

When ready re-upload the script.

At this point, I want to make sure sending mail from the cluster is possible and will test that by sending a sample email. Enabling mail is beyond the scope of this tutorial, I followed the procedure below, adjust as necessary for your environment.

sudo su
yum install postfix
/etc/init.d/postfix restart
exit

Now we're able to send mail from our node, mail needs to work on any of the nodes Oozie will execute a wf.

mail -s "test" email@email.com

hit ctrl-D, you should get an email shortly.

Finally, there are some changes we need to implement on the Oozie side. I'm not going to enable JMS alerting and only concentrate on email piece. Please consult Oozie docs for JMS part.

This is HDP 2.5.3 and things may look/act differently on your Oozie instance.

Let's go to Ambari > Configs

filter by the following property

oozie.services.ext

We're going to add these services to the existing list:

org.apache.oozie.service.EventHandlerService, 
org.apache.oozie.sla.service.SLAService

12539-9-add-oozie-property1.png

Once ready, add a couple of more custom properties in Oozie, again, in my environment these properties did not exist.

oozie.service.EventHandlerService.event.listeners

and the value should be

org.apache.oozie.sla.listener.SLAJobEventListener,
org.apache.oozie.sla.listener.SLAEmailEventListener

12540-11-add-oozie-property2.png

Oozie docs also recommend adding the following property to improve performance of event processing, we're going to add this property and set value to 15.

oozie.service.SchedulerService.threads

12541-12-add-oozie-property3.png

Once I saved the changes and restarted Oozie, it failed to start, looking at the logs I noticed the following in the oozie-error.log:

2017-02-15 18:14:32,757  WARN ConfigUtils:523 - SERVER[wfmanager-test-1.openstacklocal] Using a deprecated configuration property [oozie.service.AuthorizationService.security.enabled], should use [oozie.service.AuthorizationService.authorization.enabled].  Please delete the deprecated property in order for the new property to take effect.

I found the property in Ambari > Configs and set it to false, I was not able to delete it.

12542-15-disable-oozie-property.png

Once done, restart all Oozie services and now you're able to see a new tab in Oozie called SLA

12543-16-oozie-sla-tab.png

Remember, we only configured Email service, not JMS. We're ready to test our wf. Before that, I'd like to preview the XML for good measure.

12544-27-preview-sla-xml.png

At this point, I'm ready to submit the workflow and watch my inbox. I'm expecting to miss my job start, job end and duration. This is an email output of my workflow.

12546-20-start-miss.png

12545-18-end-miss.png

12547-21-duration-miss.png

Until next time folks!


29-clock-picker.png
4,062 Views