Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Search and find documents from HDFS.

Search and find documents from HDFS.

Expert Contributor

Hello All , 

 

I have some documents in HDFS . How can i just search and find the document . I am not bothered about contents of the document . 

 

For ex : If i search for science , I must get the documents having science  in its name ( Not in its contents ).

 

Any kind of help ll be very appreciated :) 

 

Thanks

Bala

Thanks
Bala
4 REPLIES 4

Re: Search and find documents from HDFS.

Expert Contributor
I come across HDFSFindTool . I tried search using it . Its Working fine but ll it be fast enough to handle large amount of data
Thanks
Bala

Re: Search and find documents from HDFS.

Expert Contributor
No reply ???? :(
Thanks
Bala

Re: Search and find documents from HDFS.

Contributor

Since you're asking in Cloudere Search, why not use the HDFSFindTool to get all files and feed those line by line to Solr/Search? You then have a nice index you can search through, with Hue Search or your custom interface build on top of that.

 

HDFSFindTool support getting files since time-X so you can run this process daily/hourly etc.

Highlighted

Re: Search and find documents from HDFS.

Explorer
How do you go about using this? Would you have a tutorial anywhere?