02-24-2014 12:53 PM
Suppose I have more than one Hadoop cluster. Is there any way to run a MapReduce job (or Hive query) across the multiple clusters?
I might have more than one HDFS cluster for various admin or data organization reasons, but want to run a job that scans all the data. Perhaps there would be a small Hadoop cluster that is a front-end to the other (larger) clusters.
Has anyone heard of this? Does it makes sense?
03-07-2014 06:21 PM