Is there any way to take a output of hdfs disk utilization for last 1 week in HDP 2.1. Means I want to take output of my hdfs disk utilization for last one week from command.
I have following component installed on cluster
Something like similar ambari reports as part of the alerting. You can find the alert with name:
1. HDFS Storage Capacity Usage (Daily)
2. HDFS Storage Capacity Usage (Weekly)
However you may need to check the logic for this alert throeshold calculation.
Also those alerts just watch the growth rates.
Also at the HDFS level you may need to explore the MBean:
and query: "CapacityUsedGB" and "CapacityTotalGB" on your own on a desired interval to see the growth.
$ curl -s http://erie1.example.com:50070/jmx?qry=Hadoop:service=NameNode,name=FSNamesystem | grep "Capacity*" "CapacityTotal" : 333359300608, "CapacityTotalGB" : 310.0, "CapacityUsed" : 10163200102, "CapacityUsedGB" : 9.0, "CapacityRemaining" : 191658755293, "CapacityRemainingGB" : 178.0, "CapacityUsedNonDFS" : 131537345213, "BlockCapacity" : 2097152,
However I do not see any standard in-built utility in HDFS which can give you the exact detail (like how much data in a day your HDFS is utilizing and total how much in a week).
I guess you will need to calculate it using the attributes provided by the "Hadoop:service=NameNode,name=FSNamesystem" OR using the su - hdfs -c "hdfs dfsadmin -report" report calculation.