- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Dedicated edge nodes
Created ‎08-03-2016 09:17 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi
We are thinking on having dedicated edge per project for our data lake. Each project will have a vm on which we install the required clients.
Anyone is doing this ? Any problems or issues that we should be aware of with this configuration ?
Created ‎08-03-2016 09:44 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
As you already know edge nodes are for running your client processes. They are not running your cluster processes and usually not storing data, unless you are using edge node data ingestion and staging your data in edge node. So edge node configuration can be customized quite a bit based on your needs.
I have not seen customers having separate edge nodes for each project but I don't see anything particularly wrong except that it increases the number of ways your cluster can be accessed which means increasing chances of security holes.
One main consideration, however will be to make sure you have good network and bandwidth support between your cluster and all of the edge nodes.
Other than that, provided reasonable resources (like CPU, disk specially if you are staging data for ingest and memory), this should be fine.
I would also recommend reading the accepted answer on this thread for more details to help you make decision.
https://community.hortonworks.com/questions/34872/staging-on-edge-nodes.html
Created ‎08-03-2016 09:44 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
As you already know edge nodes are for running your client processes. They are not running your cluster processes and usually not storing data, unless you are using edge node data ingestion and staging your data in edge node. So edge node configuration can be customized quite a bit based on your needs.
I have not seen customers having separate edge nodes for each project but I don't see anything particularly wrong except that it increases the number of ways your cluster can be accessed which means increasing chances of security holes.
One main consideration, however will be to make sure you have good network and bandwidth support between your cluster and all of the edge nodes.
Other than that, provided reasonable resources (like CPU, disk specially if you are staging data for ingest and memory), this should be fine.
I would also recommend reading the accepted answer on this thread for more details to help you make decision.
https://community.hortonworks.com/questions/34872/staging-on-edge-nodes.html
