Created 12-30-2016 02:59 PM
Created 01-04-2017 01:19 AM
How do you ingest this data? Can you describe your use case? This task can be easily achieved with Apache Nifi as you process data in-flight. I always found Oozie challenging for this on secure clusters. In your case I'd look at shell action but you'd have to proxy your hdfs user. It would be easier to use Oozie FS action but setfattr action is not supported in current release. Feel free to file an Apache Jira on that. https://oozie.apache.org/docs/4.2.0/WorkflowFunctionalSpec.html#a3.2.4_Fs_HDFS_action
Again, after considering all options in Oozie, I'd try Nifi first.
Created 01-02-2017 02:50 PM
I need to tag arbitrary hdfs file using below attribute. How can i do it ?? Please share idea on this.
Created 01-04-2017 01:19 AM
How do you ingest this data? Can you describe your use case? This task can be easily achieved with Apache Nifi as you process data in-flight. I always found Oozie challenging for this on secure clusters. In your case I'd look at shell action but you'd have to proxy your hdfs user. It would be easier to use Oozie FS action but setfattr action is not supported in current release. Feel free to file an Apache Jira on that. https://oozie.apache.org/docs/4.2.0/WorkflowFunctionalSpec.html#a3.2.4_Fs_HDFS_action
Again, after considering all options in Oozie, I'd try Nifi first.
Created 01-05-2017 10:41 AM
Thanks for your reply. Our data ingestion process is little complex- a lots preprocessing needs to be done before tagging incoming hdfs file from external sources. Apache nifi is cool stuff, you suggested almost every possible way to do it. But for this scenario i will go for java action instead shell action because i may need to get the meta-data from different sources that is easy to get using java. Thanks again for sharing thoughts.
Created 01-06-2017 01:01 PM
That also works, I'm going to look at the effort to contribute an enhancement for FS action to add this functionality. Seems only a few FS actions were implemented where so many are available with hdfs shell.