06-12-2016 09:25 AM - edited 06-12-2016 09:31 AM
We have generated over 20 milion pdf files every month on one of our scanner with unique id name. I want to use cloudera search/solr for faster search and make it available for download for user.
Following are my questions,
1. How to configure cloudera search for HDFS. Here all files are located /pdf folder in HDFS.
2 How to index and search for pdf files with unique id. e.g. ( INVSTD05112015.pdf)
06-16-2016 07:33 AM
You will probably need to use the go-live indexer command from cloudera:
Hope it helps you.