Support Questions

Find answers, ask questions, and share your expertise

how hadoop stores unstructured data like image, audio and video in distributed environment ?

Expert Contributor


i am new to hadoop world. i want to understand how hadoop stores unstructured data like audio, video and image over multinodes and how to process this unstructured data.

thank you


Super Mentor

@heta desai

Copying snippet from:

It is absolutely possible without doing anything extra. Hadoop provides us the facility to read/write binary files. So, practically anything which can be converted into bytes can be stored into HDFS(images, videos etc). To do that Hadoop provides something called asSequenceFiles. SequenceFile is a flat file consisting of binary key/value pairs. The SequenceFile provides a Writer, Reader and Sorter classes for writing, reading and sorting respectively. So, you could convert your image/video file into a SeuenceFile and store it into the HDFS.


Some Examples: