Member since
12-15-2015
14
Posts
1
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
4616 | 12-15-2015 08:35 AM |
02-19-2021
03:50 AM
Hello Everyone! I am a Digital Transformation Consultant working at the Cerexio Headquarters since 2015 in Singapore. I am a curious techie trying to find newer ways to helps industrial companies to foster modern technologies and transform themselves to be Industry 4.0 guaranteed companies. Currently, my focus is on working collaboratively with the Cerexio team to create the best MES system in Singapore and we are learning a lot in the process. I consult Singaporean companies about how to digitally transform by every passing minute by using tools and hacks that will help them in future. I am glad I am part of this community... I am looking forward to chatting with you guys soon!
... View more
05-06-2019
05:34 AM
Since the question was asked, the situation has changed. As soon as Hortonworks and Cloudera merged, NiFi became supported by Cloudera. Shortly after the integrations with CDH were also completed, so that NiFi is now a fully supported and integrated component. Please look into the documentation for the latest info at any time, but in general Cloudera Manager is now able to install NiFi.
... View more
07-25-2017
08:09 AM
1 Kudo
hi do you have a step by step detail for the Wildcarding of DNS? I am stuck at that point. I have an AWS EC2 instance on which I have installed the workbench. However i am unable to open the URL.
... View more
12-15-2015
12:26 PM
I'm trying to create a new Cloudera Manager dashboard chart that tracks time series data that's not currently in Cloudera Manager. Is there a way to create new variables that can be read by TSQuery? David Webb
... View more
12-15-2015
09:54 AM
1 Kudo
When I delete a Hue/Oozie workflow, the workspace directory is left behind. I went through several attempts at getting a good Oozie workflow created. Each of these included copying the same jar file to each of the workspace directories. Now I have a bunch of extra workspace directories that are no longer tied to any Hue/Oozie workflow. Is there a configuration setting that will automatically delete the underlying structure when I delete a workflow, or do I need to do this manually? Thanks in advance, David Webb
... View more
Labels:
12-15-2015
08:35 AM
I was able to rebuild the Oozie job and make it work, although I really don't know what is different. I built the job in sequence this time, so that the steps are listed in-sequence in the XML file. I also built the job steps to reference the lib directory in the job's path. I had previously had success with explicit references, but these didn't seem necessary. I moved the prepare steps to a point right before they were needed instead of all on the first step. I eliminated the output directory definition for TeraValidate because it doesn't seem to be used. Finally, I let Hue/Oozie choose the defaults for Master and Mode. I played around with trying to use YARN and cluster, but these didn't work. My resulting XML (that works) looks like this: <workflow-app name="TeraGen-TeraSort-TeraValidate" xmlns="uri:oozie:workflow:0.5"> <start to="spark-27f0"/> <kill name="Kill"> <message>Action failed, error message[${wf:errorMessage(wf:lastErrorNode())}]</message> </kill> <action name="spark-27f0"> <spark xmlns="uri:oozie:spark-action:0.1"> <job-tracker>${jobTracker}</job-tracker> <name-node>${nameNode}</name-node> <prepare> <delete path="${nameNode}/user/davidw/terasort-benchmark.in"/> </prepare> <master>local[*]</master> <mode>client</mode> <name>TeraGen</name> <class>com.github.ehiggs.spark.terasort.TeraGen</class> <jar>lib/spark-terasort.jar</jar> <arg>1g</arg> <arg>/user/davidw/terasort-benchmark.in</arg> </spark> <ok to="spark-94fc"/> <error to="Kill"/> </action> <action name="spark-94fc"> <spark xmlns="uri:oozie:spark-action:0.1"> <job-tracker>${jobTracker}</job-tracker> <name-node>${nameNode}</name-node> <prepare> <delete path="${nameNode}/user/davidw/terasort-benchmark.out"/> </prepare> <master>local[*]</master> <mode>client</mode> <name>TeraSort</name> <class>com.github.ehiggs.spark.terasort.TeraSort</class> <jar>lib/spark-terasort.jar</jar> <arg>/user/davidw/terasort-benchmark.in</arg> <arg>/user/davidw/terasort-benchmark.out</arg> </spark> <ok to="spark-bcf9"/> <error to="Kill"/> </action> <action name="spark-bcf9"> <spark xmlns="uri:oozie:spark-action:0.1"> <job-tracker>${jobTracker}</job-tracker> <name-node>${nameNode}</name-node> <master>local[*]</master> <mode>client</mode> <name>TeraValidate</name> <class>com.github.ehiggs.spark.terasort.TeraValidate</class> <jar>lib/spark-terasort.jar</jar> <arg>/user/davidw/terasort-benchmark.out</arg> </spark> <ok to="End"/> <error to="Kill"/> </action> <end name="End"/> </workflow-app>
... View more
- Tags:
- to re
12-15-2015
07:29 AM
Just a quick update. I also found that if I set the transition on the first step to "End" and then I delete the first step, the saved workflow begins with <Start to="End">
... View more