Support Questions

Find answers, ask questions, and share your expertise

how hadoop stores unstructured data like image, audio and video in distributed environment ?

avatar
Expert Contributor

hello,

i am new to hadoop world. i want to understand how hadoop stores unstructured data like audio, video and image over multinodes and how to process this unstructured data.

thank you

1 REPLY 1

avatar
Master Mentor

@heta desai

Copying snippet from: http://stackoverflow.com/questions/16546040/store-images-videos-into-hadoop-hdfs

It is absolutely possible without doing anything extra. Hadoop provides us the facility to read/write binary files. So, practically anything which can be converted into bytes can be stored into HDFS(images, videos etc). To do that Hadoop provides something called asSequenceFiles. SequenceFile is a flat file consisting of binary key/value pairs. The SequenceFile provides a Writer, Reader and Sorter classes for writing, reading and sorting respectively. So, you could convert your image/video file into a SeuenceFile and store it into the HDFS.

.

Some Examples:

http://www.tothenew.com/blog/how-to-manage-and-analyze-video-data-using-hadoop/

https://content.pivotal.io/blog/using-hadoop-mapreduce-for-distributed-video-transcoding

.