Reply
Contributor
Posts: 56
Registered: ‎02-12-2015

PDF index and search | Solr| Cloudera Search

[ Edited ]

Hi Team,

 

We have generated over 20 milion pdf files every month on one of our scanner with unique id name. I want to use cloudera search/solr for faster search and make it available for download for user.

Following are my questions,

 

1. How to configure cloudera search for HDFS. Here all files are located /pdf folder in HDFS.

 

2 How to index and search for pdf files with unique id. e.g. ( INVSTD05112015.pdf)

 

 

 

 

 

Highlighted
Explorer
Posts: 12
Registered: ‎05-26-2016

Re: PDF index and search | Solr| Cloudera Search

Hi Neelesh,

 

You will probably need to use the go-live indexer command from cloudera:

 

http://www.cloudera.com/documentation/enterprise/5-2-x/topics/search_batch_index_to_solr_servers_usi...

 

Hope it helps you.