Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

File distribution on HDFS

avatar
Rising Star

Hi dear expert!

i'm wondering is there any way to check file distribution amond nodes in HDFS?

 

some way that allow to check on which nodes place some particular file of dirrectory?

 

thanks!

1 ACCEPTED SOLUTION

avatar
Mentor
If by 'hard to analyse' you mean to parse/process it, you can consider
using the Java API to fetch block location info too:
http://archive.cloudera.com/cdh5/cdh/5/hadoop/api/org/apache/hadoop/fs/FileSystem.html#getFileBlockL...

View solution in original post

3 REPLIES 3

avatar
Mentor
You can check the file on NN Web UI's File Browser (shows all block IDs and
their locations), or run 'hdfs fsck /path/to/file -files -blocks -locations'

avatar
Rising Star
Thanks! Hdfs fsck will works, but it's hard to analyze in case of big file. maybe it is other way to get aggregate values?

avatar
Mentor
If by 'hard to analyse' you mean to parse/process it, you can consider
using the Java API to fetch block location info too:
http://archive.cloudera.com/cdh5/cdh/5/hadoop/api/org/apache/hadoop/fs/FileSystem.html#getFileBlockL...