I run HDP 2.6 and i need clarifications on the disk usage. When i run "hdfs dfs -dh -h" command on the hdfs directory, it gives a ridiculous size of 28.1T but when i drill down to each day and sum it all up, it's just over 8TB of data.
Why is there a huge difference?
28.1 T /in/feed/type
98.8 G /in/feed/type/day=20180701
112 G /in/feed/type/day=20180702
104.4 G /in/feed/type/day=20180703