I am not sure if and what I should be setting the mapred.reduce.tasks and mapred.map.tasks value too and if it makes a difference. I read that if set to -1 one the correct number of tasks will be calculated?
I have seen this error on occasions in the Job Error Log but am not sure what it means or how to resolve : WARN ResumeXCommand:523 - SERVER[ip-*.*.ec2.internal] USER[hdfs] GROUP[-] TOKEN APP[s3-wf-forked] JOB[0000118-191014051313813-oozie-oozi-W] ACTION E1100: Command precondition does not hold before execution, [workflow's status is RUNNING is not SUSPENDED], Error Code: E1100
As well as this error in the Coordinator which launches the Workflow : No actions to start for jobId=0000117-191014051313813-oozie-oozi-C as max concurrency reached!