Created 07-04-2017 06:07 AM
One of my workflows in oozie is very important and I would always like it to be allocated the reesources it needs before anything else.
This workflow has a sqoop job and hive queries. What do I need to do to assign VERY_HIGH priority to this workflow . A direction would be appreciated. I have tried looking it up but still pretty clueless about "how" to do it
Created 07-04-2017 06:28 AM
In the capacity scheduler you could set a high priority queue providing the maximum resources of the cluster with extension( max. Capacity) to 100%. And while running the oozie workflow, you can provide the queue name in Job.properties and workflow.xml so that the job is always submitted to a high priority queue.
queueName=<queue-name>
<configuration> <property> <name>mapred.job.queue.name</name> <value>${queueName}</value> </property> </configuration>
Created 07-04-2017 11:05 AM
@Shashank Chandhok: Thank you. I have done that and now my job is being submitted to a queue that is not pre-empted . But I would also like to set priority of the job. I tried setting SET mapred.job.priority in hive query itself in the editor but even then these are submitted with NORMAL priority as I see through hadoop job -list
Created 07-04-2017 07:12 PM
@Simran Kaur I believe currently oozie does not support the functionality of tagging jobs with prioirities Check the following links for more details
Created 07-04-2017 06:30 AM
You can create separate yarn queue for these jobs. You can use capacity scheduler or Yarn queue manager view in Ambari .
Now you can use this queue for your high priority oozie jobs
https://stackoverflow.com/questions/32438052/job-queue-for-hive-action-in-oozie
Kindly let me know if this helps.
Created 07-04-2017 11:05 AM
@prsinghThank you. I have done that and now my job is being submitted to a queue that is not pre-empted . But I would also like to set priority of the job. I tried setting SET mapred.job.priority in hive query itself in the editor but even then these are submitted with NORMAL priority as I see through hadoop job -list
Created 07-04-2017 11:12 AM
Please try with "mapreduce.job.priority" as "mapred.job.priority" is deprecated now.
Created 07-04-2017 12:11 PM
I did. Nothing changed