Created on 07-21-201610:13 PM - edited 09-16-202201:35 AM
Using the GetHTTP Processor we grab random images from the DigitalOcean's Unsplash.it free image site. I give it a random file name so we can save it uniquely in HDFS.
The Entire Data Flow from GetHTTP to Final HDFS storage of image and it's metadata as JSON.