- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Mix cloud and on premise Data nodes
- Labels:
-
Apache YARN
Created on ‎09-21-2017 10:18 AM - edited ‎09-16-2022 08:46 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi, is it supported to have some Worker/Data nodes on premise and others in the AWS or Azure cloud within the same cluster managed by the same NN/YARN? What are the issues with this?
Created ‎09-25-2017 09:42 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
ebeb,
Cloudera Director is designed for provisioning cloud clusters and cannot create a mixed cloud/on-premise cluster.
My initial thoughts on a mixed cluster is that network latency between the cloud nodes and on-premise nodes would lead to poor performance. You may be better served by creating two separate clusters and using some other mechanism for sharing data. For example, you can export/import data to S3.
Created ‎09-25-2017 09:42 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
ebeb,
Cloudera Director is designed for provisioning cloud clusters and cannot create a mixed cloud/on-premise cluster.
My initial thoughts on a mixed cluster is that network latency between the cloud nodes and on-premise nodes would lead to poor performance. You may be better served by creating two separate clusters and using some other mechanism for sharing data. For example, you can export/import data to S3.
