Support Questions
Find answers, ask questions, and share your expertise
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Access child flowfiles after split content


Access child flowfiles after split content

New Contributor

I want to use SplitContent to split the PDF binary from a multipart SOAP message. I can see SplitContent generates 2 child flowfiles from data provenance. How can I access the specific child flowfile containing the PDF binary?

I tried the ExtractText with regex (%PDF-.*%%EOF) to extract the PDF binary. But after extraction, the binary contains a lot of '?' which corrupted the file. Any idea?

Don't have an account?
Coming from Hortonworks? Activate your account here