The customer wants to setup a backup to a backup cluster. They want to know how many NameNodes are required on the Backup Cluster. They want to know if there is some rule of thumb. The constraints for the backup cluster is below
Backup window: 1am-5am ET (12am-4am CT)
Want to have a yarn job that runs Falcon backup jobs in another queue when the cluster isn’t busy - that way it doesn't pile up for the night.
The clusters will be in Amazon, so they want the backup cluster to be backed by Amazon S3 storage (being used as HDFS)