Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

PDF index and search | Solr| Cloudera Search

Highlighted

PDF index and search | Solr| Cloudera Search

Explorer

Hi Team,

 

We have generated over 20 milion pdf files every month on one of our scanner with unique id name. I want to use cloudera search/solr for faster search and make it available for download for user.

Following are my questions,

 

1. How to configure cloudera search for HDFS. Here all files are located /pdf folder in HDFS.

 

2 How to index and search for pdf files with unique id. e.g. ( INVSTD05112015.pdf)

 

 

 

 

 

1 REPLY 1

Re: PDF index and search | Solr| Cloudera Search

Explorer

Hi Neelesh,

 

You will probably need to use the go-live indexer command from cloudera:

 

http://www.cloudera.com/documentation/enterprise/5-2-x/topics/search_batch_index_to_solr_servers_usi...

 

Hope it helps you.