Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

how to develop map reduce pdf processing java code for hadoop 2.6.0 ?

how to develop map reduce pdf processing java code for hadoop 2.6.0 ?

New Contributor
 
1 REPLY 1

Re: how to develop map reduce pdf processing java code for hadoop 2.6.0 ?

Expert Contributor

If you are looking for extracting text from PDF, I had done this via Apache Tika. Its simpler to use.

Don't have an account?
Coming from Hortonworks? Activate your account here