What are the available cluster types in Data Engineering Cluster?
Currently, Cloudera provides clusters that can run the following Data Engineering job types:
- Hive on MapReduce
- Hive on Spark
- Spark on YARN
Note: Job submission for a cluster is restricted to the type of job for that various cluster. For example, if you wanted to run just a MR2 job on a Spark-on-YARN cluster, you cannot submit this, even though Spark is configured to use YARN. You would need to standup a separate YARN cluster for it to accept a MR2/YARN job.