Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Understand the dfsclusterhealth informations

avatar
Champion Alumni

Hello,

 

The dfsclusterhealth view of the namenodes is givin me the folowing informations:

Total Files And Directories	:	64340
Configured Capacity	:	1.88 TB
DFS Used	:	2.21 TB
Non DFS Used	:	120.56 GB
DFS Remaining	:	674.13 GB
DFS Used%	:	117.57%
DFS Remaining%	:	34.96%

How can the DFS used be bigger than the configured capacity?

What is the difference?

 

Thank you,

GHERMAN Alina
1 ACCEPTED SOLUTION

avatar
Guru

I believe that's because of replication. 2.21 TB refers to the amount of space taken up by the data you've stored plus the space taken up by all the replicas of that data. 1.88 TB refers to the amount of data you can store, accounting for the fact that HDFS will need space to make replicas.

View solution in original post

1 REPLY 1

avatar
Guru

I believe that's because of replication. 2.21 TB refers to the amount of space taken up by the data you've stored plus the space taken up by all the replicas of that data. 1.88 TB refers to the amount of data you can store, accounting for the fact that HDFS will need space to make replicas.