Support Questions

Find answers, ask questions, and share your expertise

what are the steps in order to replace faulty disk in datanode

avatar

we have ambari cluster - version 2.6.1

HDP - version 2.6.4

each datanode have 12 disks with 500G size

one of the disk is faulty disk and need to replace it

what are the full steps that required , in order to replace the faulty disk

Michael-Bronson
14 REPLIES 14

avatar

@Jay , but it will Decommission all datanode ? , and we want to Decommission only the datanode with the faulty disk ( am I right here ? ) , or some other option to Decommission only the specific datanode ( worker03 )

Michael-Bronson

avatar
Master Mentor

@Michael Bronson

You can select the host which you want and has DataNode (Please use "Selected Hosts" option)

Ambari UI -->Hosts(Tab)-->"Actions"(Drop down)--> Select the Host that you want --> then click on "Selected Hosts" --> DataNode --> Decommission

.

This will ensure that the operations should be only performed to the list of host which you have selected using "

Selected Hosts" option.

avatar
Master Mentor

@Michael Bronson

Regarding your query: "the option Selected hosts isn't active ( in spite I see it )"

Here are the steps:

1. When you click on "Hosts" tab then a page opens with the list of hosts.

2. You will need to click on the "Checkbox" just in front of the hostnames mentioned in hosts page.

3. Then click on "Actions" --> "Selected Hosts" (you should see the number of hosts which you checked in step2)

4. Now you should be able to do:

Ambari UI -->Hosts(Tab)-->"Actions"(Drop down)--> Select the Host that you want --> then click on "Selected Hosts" --> DataNode --> Decommission

.

avatar

@Jay , now its clear, thanks to your great explain , when I do decommission , is it mean that datanode component can be up? , of before the decommision we need to stop anyway the componet on the datanode ?

Michael-Bronson

avatar

@Jay , after we do the decommission on some datanode , do we need also to stop the components on that datanode ? and then replaced the disk , or maybe it is inoufgh to decomission without to stop the componet and then replaced the disk ?

Michael-Bronson