Hello All ,
I have some documents in HDFS . How can i just search and find the document . I am not bothered about contents of the document .
For ex : If i search for science , I must get the documents having science in its name ( Not in its contents ).
Any kind of help ll be very appreciated :)
Since you're asking in Cloudere Search, why not use the HDFSFindTool to get all files and feed those line by line to Solr/Search? You then have a nice index you can search through, with Hue Search or your custom interface build on top of that.
HDFSFindTool support getting files since time-X so you can run this process daily/hourly etc.