Our goal is to pull data from DB and push it to HDFS. This is streaming data, so we want to make sure it pulls in incremental data every hour or so.
Now it seems Nifi is a good choice for that. However, given the lack of documentation, we are trying to understand if there are any examples on the data flow from SQL to a log file or to HDFS. To be more precise we want to know the processors that we can use.
For Eg: http://www.slideshare.net/hortonworks/design-a-dataflow-in-7-minutes-58718224 The example here pulls data from Twitter to HDFS. We need it from SQL to HDFS.
... View more