Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

how hadoop stores unstructured data like image, audio and video in distributed environment ?

avatar
Expert Contributor

hello,

i am new to hadoop world. i want to understand how hadoop stores unstructured data like audio, video and image over multinodes and how to process this unstructured data.

thank you

1 REPLY 1

avatar
Master Mentor

@heta desai

Copying snippet from: http://stackoverflow.com/questions/16546040/store-images-videos-into-hadoop-hdfs

It is absolutely possible without doing anything extra. Hadoop provides us the facility to read/write binary files. So, practically anything which can be converted into bytes can be stored into HDFS(images, videos etc). To do that Hadoop provides something called asSequenceFiles. SequenceFile is a flat file consisting of binary key/value pairs. The SequenceFile provides a Writer, Reader and Sorter classes for writing, reading and sorting respectively. So, you could convert your image/video file into a SeuenceFile and store it into the HDFS.

.

Some Examples:

http://www.tothenew.com/blog/how-to-manage-and-analyze-video-data-using-hadoop/

https://content.pivotal.io/blog/using-hadoop-mapreduce-for-distributed-video-transcoding

.