Created 03-22-2016 05:37 PM
What are the best practices around copying data between two clusters located in different datacenter on different LAN, the scope is to limit loops.
Created 03-22-2016 09:41 PM
Created 03-22-2016 09:02 PM
Today client is using couple of staging/ftp servers but want to know if there are other practices, all the data is in HDFS.
Created 03-22-2016 09:41 PM
You can use Apache Falcon http://hortonworks.com/hadoop/falcon/
or see this https://community.hortonworks.com/articles/9933/apache-nifi-aka-hdf-data-flow-across-data-center.htm...