- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
HDFS replica + and min data nodes number in the HDFS cluster
- Labels:
-
Ambari Blueprints
Created on 07-06-2020 04:53 AM - edited 07-06-2020 04:55 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
we have Hadoop cluster with only 2 data nodes machines
in HDFS configuration we defined the Block replication to 3
so
Block replication=3
is it OK? to defined Block replication=3 , when we have only two data nodes in the cluster?
from my understanding when we defined Block replication=3 while we have 2 data nodes machines in HDFS cluster
its means that one machine should have 2 replica . and the other machine one replica , am I correct here?
Created 07-14-2020 12:27 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
@mike_bronson7 It is recommended to have minimum 3 data nodes in the cluster to accommodate 3 healthy replicas of a block as the default replication factor is 3. HDFS will not write replicas of same blocks on the same data node. In this scenario there will be under replicated blocks and 2 healthy replicas will be placed on the available 2 data nodes.
Created 07-14-2020 12:27 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
@mike_bronson7 It is recommended to have minimum 3 data nodes in the cluster to accommodate 3 healthy replicas of a block as the default replication factor is 3. HDFS will not write replicas of same blocks on the same data node. In this scenario there will be under replicated blocks and 2 healthy replicas will be placed on the available 2 data nodes.