Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

adding additional volume to data nodes.

avatar
Contributor

We are running hadoop HA cluster using AWS EC2 instances with 17 Data ndoes (All instances are M4.4xlarge including name nodes). All the DN's are configured with 16TB (EBS st1) volumes for hdfs.

Now we are running out of HDFS storage and looking to extend the storage. Since 16TB is max limit for st1 EBS we cannot extend the existing volume.

Trying to add additional 16TB volumes to few data nodes and update "DataNode directories" in ambari with this new volume path.

Will this approach impact any performance issue with cluster ? Any other things need be considered in this approach ?

1 ACCEPTED SOLUTION

avatar
Contributor

@Sajesh PP - As you increase the size of a data node you can run into performance problems such as to much read/write activity on a single data node. If this occurs it is better to add new data nodes with additional storage.

View solution in original post

1 REPLY 1

avatar
Contributor

@Sajesh PP - As you increase the size of a data node you can run into performance problems such as to much read/write activity on a single data node. If this occurs it is better to add new data nodes with additional storage.