Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

adding additional volume to data nodes.

Solved Go to solution
Highlighted

adding additional volume to data nodes.

New Contributor

We are running hadoop HA cluster using AWS EC2 instances with 17 Data ndoes (All instances are M4.4xlarge including name nodes). All the DN's are configured with 16TB (EBS st1) volumes for hdfs.

Now we are running out of HDFS storage and looking to extend the storage. Since 16TB is max limit for st1 EBS we cannot extend the existing volume.

Trying to add additional 16TB volumes to few data nodes and update "DataNode directories" in ambari with this new volume path.

Will this approach impact any performance issue with cluster ? Any other things need be considered in this approach ?

1 ACCEPTED SOLUTION

Accepted Solutions

Re: adding additional volume to data nodes.

Cloudera Employee

@Sajesh PP - As you increase the size of a data node you can run into performance problems such as to much read/write activity on a single data node. If this occurs it is better to add new data nodes with additional storage.

1 REPLY 1

Re: adding additional volume to data nodes.

Cloudera Employee

@Sajesh PP - As you increase the size of a data node you can run into performance problems such as to much read/write activity on a single data node. If this occurs it is better to add new data nodes with additional storage.