Hi,
I am doing some research into non commercial solutions for taking a scanned PDF document and using Tesseract OCR to create a searchable PDF document. I am new to the HortonWorks world and community and any suggestions would be much appreciated.
Thanks!