Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Difference between spaceConsumed vs length parameters in getContentSummary using webHDFS REST API

avatar
Expert Contributor

According to the following doc, spaceConsumed refers to the "The disk space consumed by the content." and length refers to "The number of bytes used by the content."

https://hadoop.apache.org/docs/r1.0.4/webhdfs.html#ContentSummary

What would be the real difference here in terms of space taken from HDFS?

Consider that most of time, if not always, spaceConsumed has a lot larger value then length. What does spaceConsumed consider that length does not?

1 ACCEPTED SOLUTION

avatar
Expert Contributor

spaceConsumed = length * replicationFactor

View solution in original post

1 REPLY 1

avatar
Expert Contributor

spaceConsumed = length * replicationFactor