Created 02-09-2016 05:40 PM
Hi,
I am new to HDF and have few queries on HDF and its configuration. Can anyone please answer my below queries.
Thanks
Created 02-09-2016 05:43 PM
Great question
You can find really useful info here
Created 02-09-2016 05:43 PM
Great question
You can find really useful info here
Created 02-09-2016 05:45 PM
1. Nifi is not a replacement for Oozie, you can't schedule jobs though you can run cron commands and execute shell commands within Nifi. It's not a start and stop operation, it continuously runs until you explicitly stop it. You can take a look at rest api to start and stop workflow if that's what you're asking. In the next release, nifi will have scripting capabilities so essentially you can execute groovy, shell, maybe python and maybe pig but I cannot comment on the last two.
2. https://community.hortonworks.com/content/kbentry/886/securing-nifi-step-by-step.html
3. file a jira
Created 02-11-2016 08:36 PM
@Artem Ervits Thanks for the info. For the first query, my intention was not to see whether Nifi works as a Oozie replacement, but to see how to get functionality like oozie in HDF world. On further reading, I found out that at each processor level, I can have scheduling (timer based, cron based or event based etc). This is sufficient for our requirements.
For security, I need to look into it deeper. Will come back later with further queries.
Many Thanks
Created 02-11-2016 08:45 PM
@Neeraj Sabharwal @Artem Ervits
Just wondering what is the best mechanism to ingest data from relational sources into HDP. To use a combination of ExecuteSQL and putHDFS processors or to use Sqoop and deliver the data to HDP?
Many Thanks