We have generated over 20 milion pdf files every month on one of our scanner with unique id name. I want to use cloudera search/solr for faster search and make it available for download for user.
Following are my questions,
1. How to configure cloudera search for HDFS. Here all files are located /pdf folder in HDFS.
2 How to index and search for pdf files with unique id. e.g. ( INVSTD05112015.pdf)
You will probably need to use the go-live indexer command from cloudera:
Hope it helps you.