Just wondering on the processing techniques available for typical unstructured data with the Hadoop ecosystem. For example, is there any processing framework which supports processing images, audio, video etc?
For example, if its just extracting the metadata, Tika / Lucene can be used. However, if I have to process the image file to look for some object / process CCTV footage to look for any suspicious entities, how to do with the data stored in HDFS?
you can implement a mapreduce job with OpenCV library. @Greenhorn Techie
@Artem Ervits Thanks for your response. I believe to make use of OpenCV, we need to use Hadoop Streaming API? Alternatively, JavaCV might be usable. Overall, I think image processing can be handled better than more complex types like audio and video.
Is there any similar capability for audio and video?