<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Looking for something like GenerateTableFetch in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Looking-for-something-like-GenerateTableFetch/m-p/309690#M223905</link>
    <description>&lt;P&gt;Figured out an alternative way. I developed a Oracle PL/SQL function which takes table name as an argument, and produces a series of queries like "SELECT * FROM T1 OFFSET x ROWS FETCH NEXT 10000 ROWS ONLY". The number of queries is based on the number of rows of the table, which is a statistics number in the catalog table. If the table has 1M rows, and I want to have 100k rows in each batch, it will produces 10 queries. I use ExecuteSQLRecord to call this function, which effectively does the job of NiFi processor GenerateTableFetch. My next processor (e.g. ExecuteSQLRecord again) can now have 10 concurrent tasks working in parallel.&lt;/P&gt;</description>
    <pubDate>Thu, 14 Jan 2021 20:53:25 GMT</pubDate>
    <dc:creator>Garyy</dc:creator>
    <dc:date>2021-01-14T20:53:25Z</dc:date>
    <item>
      <title>Looking for something like GenerateTableFetch</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Looking-for-something-like-GenerateTableFetch/m-p/309634#M223884</link>
      <description>&lt;P&gt;I use&amp;nbsp;&lt;SPAN&gt;ExecuteSQLRecord to run a query and write to CSV format. The table has 10M rows. Although I can split the output into multiple flow files, the query is executed by only a single thread and is very slow. &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Is there a way to partition the query into multiple queries so that the next processor can run multiple concurrent tasks, each one process one partition? It would be like:&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="lia-indent-padding-left-60px"&gt;&lt;SPAN&gt;GenerateTableFetch -&amp;gt; ExecuteSQLRecord (with concurrent tasks)&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;The problem is that GenerateTableFetch only accepts table name as input. It does not accept customized queries.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Please advise if you have solutions. I am new to NiFi. So I would like your details. Thank you in advance.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 13 Jan 2021 23:30:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Looking-for-something-like-GenerateTableFetch/m-p/309634#M223884</guid>
      <dc:creator>Garyy</dc:creator>
      <dc:date>2021-01-13T23:30:27Z</dc:date>
    </item>
    <item>
      <title>Re: Looking for something like GenerateTableFetch</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Looking-for-something-like-GenerateTableFetch/m-p/309690#M223905</link>
      <description>&lt;P&gt;Figured out an alternative way. I developed a Oracle PL/SQL function which takes table name as an argument, and produces a series of queries like "SELECT * FROM T1 OFFSET x ROWS FETCH NEXT 10000 ROWS ONLY". The number of queries is based on the number of rows of the table, which is a statistics number in the catalog table. If the table has 1M rows, and I want to have 100k rows in each batch, it will produces 10 queries. I use ExecuteSQLRecord to call this function, which effectively does the job of NiFi processor GenerateTableFetch. My next processor (e.g. ExecuteSQLRecord again) can now have 10 concurrent tasks working in parallel.&lt;/P&gt;</description>
      <pubDate>Thu, 14 Jan 2021 20:53:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Looking-for-something-like-GenerateTableFetch/m-p/309690#M223905</guid>
      <dc:creator>Garyy</dc:creator>
      <dc:date>2021-01-14T20:53:25Z</dc:date>
    </item>
  </channel>
</rss>

