Created on 05-22-2017 09:29 AM - edited 08-18-2019 02:03 AM
Hello guys!
I'm trying to setup in hortonworks a tutorial from cloudera to create a pipeline to analyse twitter data .
I've managed to successfully configure flume and HDFS and now I'm trying to setup Oozie to run an ALTER TABLE command that adds a partition containing the last hour’s worth of data into Hive. However when I try to validate the process I get stuck in an error. Here's what I'm doing:
1. I create a new workflow and select import asset from shared file system:
2. I select the file hive-action.xml - you can find the source code on github:
3. I fill the forms with the data from coord-app.xml and jobproperties:
4. When I press validate:
What can I possibly be doing wrong here?
Do I also need to create a coordinator?
Many thanks in advance!
Best regards!
Created 05-26-2017 09:01 AM
the problem was in the ip. Make sure you put hdfs:// ...
Created 05-24-2017 06:38 PM
Is this a kerberos enabled cluster?
Also did you configure the oozie proxy user?
https://oozie.apache.org/docs/3.2.0-incubating/AG_Install.html#User_ProxyUser_Configuration
Created 05-25-2017 08:42 AM
Hello Amer and many thanks for your answer!
Kerberos is not enabled.
Hadn't configure the oozie proxy user but now I did (add the 2 properties with * as wildcard) but the problem persists 😞
What else can I be doing wrong? Best regards
Created 05-25-2017 10:47 AM
Managed to solve the problem via command line.
Now it gives me an error related with mapreduce:
org.apache.oozie.action.ActionExecutorException: JA009: Cannot initialize Cluster. Please check your configuration for mapreduce.framework.name and the correspond server addresses.
Created 05-26-2017 09:01 AM
the problem was in the ip. Make sure you put hdfs:// ...