<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Split FlowFiles to Multiple ConvertJsonToSQL Processors in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Split-FlowFiles-to-Multiple-ConvertJsonToSQL-Processors/m-p/219411#M79431</link>
    <description>&lt;P&gt;Thanks it works.&lt;/P&gt;</description>
    <pubDate>Tue, 12 Jun 2018 00:28:41 GMT</pubDate>
    <dc:creator>yjabbary</dc:creator>
    <dc:date>2018-06-12T00:28:41Z</dc:date>
    <item>
      <title>Split FlowFiles to Multiple ConvertJsonToSQL Processors</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Split-FlowFiles-to-Multiple-ConvertJsonToSQL-Processors/m-p/219409#M79429</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I have 100K flowfiles generated by custom processor and i need to store them to mySQL DB, I need to process the 100k flowfiles by multiple ConvertJsonToSQL Processors concurrently to speed the insertion process. what is the processor that i should use between the custom processor and ConvertJsonToSQL processors (4 ConvertJsonToSQL processors) in order to achieve that.&lt;/P&gt;&lt;P&gt;Thanks,,&lt;/P&gt;</description>
      <pubDate>Tue, 21 Apr 2026 12:17:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Split-FlowFiles-to-Multiple-ConvertJsonToSQL-Processors/m-p/219409#M79429</guid>
      <dc:creator>yjabbary</dc:creator>
      <dc:date>2026-04-21T12:17:08Z</dc:date>
    </item>
    <item>
      <title>Re: Split FlowFiles to Multiple ConvertJsonToSQL Processors</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Split-FlowFiles-to-Multiple-ConvertJsonToSQL-Processors/m-p/219410#M79430</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/83213/yjabbary.html" nodeid="83213" target="_blank"&gt;@yazeed salem&lt;/A&gt; &lt;/P&gt;&lt;UL&gt;&lt;LI&gt;If your flowfile content is already in json format and each message/record are in &lt;STRONG&gt;one line&lt;/STRONG&gt; then use Split Text processor with split line count of &amp;lt;desired number&amp;gt;&lt;/LI&gt;&lt;LI&gt;If your flowfile content is already in json and each message are not in one line then use &lt;STRONG&gt;Split Record&lt;/STRONG&gt; processor and configure &lt;STRONG&gt;record reader/writer controller services&lt;/STRONG&gt;(define matching avro schema to the incoming flowfile content), change the records per split property as your desired number. using &lt;STRONG&gt;Split record&lt;/STRONG&gt; processor will be &lt;STRONG&gt;efficient&lt;/STRONG&gt; as the processor works with chunks of data.&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Refer to &lt;A href="https://community.hortonworks.com/articles/189642/update-the-contents-of-flowfile-by-using-updaterec.html" rel="nofollow noopener noreferrer" target="_blank"&gt;this&lt;/A&gt; and &lt;A href="https://community.hortonworks.com/articles/115311/convert-csv-to-json-avro-xml-using-convertrecord-p.html" rel="nofollow noopener noreferrer" target="_blank"&gt;this&lt;/A&gt; links to configure Record Reader/Writer Controller services.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Flow:&lt;/STRONG&gt;&lt;/P&gt;&lt;PRE&gt;1.Custom processor2.SplitRecord/SplitText processors3.DistributeLoad
4.ConvertJsonToSQL&lt;/PRE&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="77638-flow.png" style="width: 1012px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/15987iED64683A22A43CD6/image-size/medium?v=v2&amp;amp;px=400" role="button" title="77638-flow.png" alt="77638-flow.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;U&gt;DistributeLoad Configs:&lt;/U&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="77639-dl.png" style="width: 705px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/15988i64DF57D97B2DDA9F/image-size/medium?v=v2&amp;amp;px=400" role="button" title="77639-dl.png" alt="77639-dl.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;&lt;STRONG&gt;Number of Relationships&lt;/STRONG&gt;&lt;/TD&gt;&lt;TD&gt;1&lt;/TD&gt;&lt;TD&gt;Determines the number of Relationships to which the load should be distributed&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;STRONG&gt;Distribution Strategy&lt;/STRONG&gt;&lt;/TD&gt;&lt;TD&gt;round robin&lt;/TD&gt;&lt;TD&gt;&lt;UL&gt;&lt;LI&gt;round robin&lt;/LI&gt;&lt;LI&gt;next available&lt;/LI&gt;&lt;LI&gt;load distribution service&lt;/LI&gt;&lt;/UL&gt;&lt;/TD&gt;&lt;TD&gt;Determines how the load will be distributed. If using Round Robin, will not distribute any FlowFiles unless all destinations can accept FlowFiles; when using Next Available, will distribute FlowFiles as long as at least 1 destination can accept FlowFiles.&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;
As i have configured &lt;STRONG&gt;Number of Relationships to 3&lt;/STRONG&gt; then connected &lt;BR /&gt;
&lt;UL&gt;
&lt;LI&gt;1 relationship from Distribute Load processor to first ConvertJsonToSql processor &lt;/LI&gt;&lt;LI&gt;2 to second ConvertJsonToSql processor &lt;/LI&gt;&lt;LI&gt;3 relationship  to third ConvertJsonToSql processor&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Based on the number of splits that you want change the configs in Distribute Load processor and add more ConvertJsonToSQL processor.&lt;/P&gt;&lt;P&gt;In addition &lt;STRONG&gt;please consider using record oriented PutDatabaseRecord&lt;/STRONG&gt; processor which works on chunks of data, Configure the Record Reader controller service to read the incoming flowfile, then i think you don't have to split any records also.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;U&gt;Flow:&lt;/U&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;PRE&gt;1.Custom Processor&lt;BR /&gt;2.PutDatabaseRecord  &lt;/PRE&gt;&lt;P&gt;-&lt;/P&gt;&lt;P&gt;If the Answer addressed your question, &lt;STRONG&gt;Click on Accept button below to accept the answer, &lt;/STRONG&gt;That would be great help to Community users to find solution quickly for these kind of issues.&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 02:39:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Split-FlowFiles-to-Multiple-ConvertJsonToSQL-Processors/m-p/219410#M79430</guid>
      <dc:creator>Shu_ashu</dc:creator>
      <dc:date>2019-08-18T02:39:58Z</dc:date>
    </item>
    <item>
      <title>Re: Split FlowFiles to Multiple ConvertJsonToSQL Processors</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Split-FlowFiles-to-Multiple-ConvertJsonToSQL-Processors/m-p/219411#M79431</link>
      <description>&lt;P&gt;Thanks it works.&lt;/P&gt;</description>
      <pubDate>Tue, 12 Jun 2018 00:28:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Split-FlowFiles-to-Multiple-ConvertJsonToSQL-Processors/m-p/219411#M79431</guid>
      <dc:creator>yjabbary</dc:creator>
      <dc:date>2018-06-12T00:28:41Z</dc:date>
    </item>
  </channel>
</rss>

