Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

MergeContent in a cluster

MergeContent in a cluster

Contributor

I have a table with 12M rows coming from Netezza which I need to push it into S3. This is how I have the pipeline setup currently:

GenerateTableFetch->ExecuteSQL->ConvertRecord->MergeContent->PutS3Object

ExecuteSQL & ConvertRecord have Load Balancing turned on.

MergeContent apparently merges data within each data node. How do I combine flowfiles from all Data Nodes into one flowfile before pushing into S3?


1 REPLY 1

Re: MergeContent in a cluster

Contributor

MergeContent settings. I tried different settings but dont get quite get it to work.


109412-1560803749025.png