<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: ConvertExcelToCSVProcessor - File too Large in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/ConvertExcelToCSVProcessor-File-too-Large/m-p/364850#M239252</link>
    <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/92365"&gt;@TRSS_Cloudera&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;The issue you have described links to this known issue reported in Apache NiFi"&lt;BR /&gt;&lt;A href="https://issues.apache.org/jira/browse/NIFI-10792" target="_blank"&gt;https://issues.apache.org/jira/browse/NIFI-10792&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;The discussion found in the comments of this jira point to a couple workarounds which includes the negatives of each.&lt;BR /&gt;&lt;BR /&gt;From that discussion it appears the best approach is development of a new "Excel Record Reader" controller service that could be used by the existing ConvertRecord processor and CSVRecordSetWriter.&lt;BR /&gt;This is outlined in following jira:&lt;BR /&gt;&lt;A href="https://issues.apache.org/jira/browse/NIFI-11167" target="_blank"&gt;https://issues.apache.org/jira/browse/NIFI-11167&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="batang,apple gothic"&gt;If you found that the provided solution(s) assisted you with your query, please take a moment to login and click&lt;/FONT&gt;&amp;nbsp;&lt;FONT face="arial black,avant garde" color="#FF0000"&gt;Accept as Solution&amp;nbsp;&lt;/FONT&gt;&lt;FONT face="batang,apple gothic" color="#000000"&gt;below each response that helped.&lt;BR /&gt;&lt;BR /&gt;Thank you,&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="batang,apple gothic" color="#000000"&gt;Matt&lt;/FONT&gt;&lt;/P&gt;</description>
    <pubDate>Tue, 28 Feb 2023 19:28:52 GMT</pubDate>
    <dc:creator>MattWho</dc:creator>
    <dc:date>2023-02-28T19:28:52Z</dc:date>
    <item>
      <title>ConvertExcelToCSVProcessor - File too Large</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ConvertExcelToCSVProcessor-File-too-Large/m-p/364838#M239249</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have a workflow that is picking up an Excel file, that contains 3 sheets, and is attempting to run it through a ConvertExcelToCSVProcessor, but it is failing with the error below:&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier" size="2" color="#FF0000"&gt;&lt;SPAN&gt;&lt;SPAN class="ui-provider vx b c d e f g h i j k l m n o p q r s t u v w x y z ab ac ae af ag ah ai aj ak"&gt;Failed to process incoming Excel document. Tried to allocate an array of length 328,219,733, but the maximum length for this record type is 100,000,000. If the file is not corrupt or large, please open an issue on bugzilla to request increasing the maximum allowable size for this record type. As a temporary workaround, consider setting a higher override value with IOUtils.setByteArrayMaxOverride(): org.apache.poi.util.RecordFormatException: Tried to allocate an array of length 328,219,733, but the maximum length for this record type is 100,000,000. If the file is not corrupt or large, please open an issue on bugzilla to request increasing the maximum allowable size for this record type. As a temporary workaround, consider setting a higher override value with IOUtils.setByteArrayMaxOverride()&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Has anyone else run into this error and been able to get around the issue? I'm not seeing where I could set a new value for&amp;nbsp;&lt;SPAN class="ui-provider vx b c d e f g h i j k l m n o p q r s t u v w x y z ab ac ae af ag ah ai aj ak"&gt;IOUtils.setByteArrayMaxOverride(). Other option I am considering is a Python script to perform this task, but that would add a great deal of more complexity to my flow.&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;SPAN class="ui-provider vx b c d e f g h i j k l m n o p q r s t u v w x y z ab ac ae af ag ah ai aj ak"&gt;Thanks for any help!&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 28 Feb 2023 15:39:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ConvertExcelToCSVProcessor-File-too-Large/m-p/364838#M239249</guid>
      <dc:creator>TRSS_Cloudera</dc:creator>
      <dc:date>2023-02-28T15:39:07Z</dc:date>
    </item>
    <item>
      <title>Re: ConvertExcelToCSVProcessor - File too Large</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ConvertExcelToCSVProcessor-File-too-Large/m-p/364850#M239252</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/92365"&gt;@TRSS_Cloudera&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;The issue you have described links to this known issue reported in Apache NiFi"&lt;BR /&gt;&lt;A href="https://issues.apache.org/jira/browse/NIFI-10792" target="_blank"&gt;https://issues.apache.org/jira/browse/NIFI-10792&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;The discussion found in the comments of this jira point to a couple workarounds which includes the negatives of each.&lt;BR /&gt;&lt;BR /&gt;From that discussion it appears the best approach is development of a new "Excel Record Reader" controller service that could be used by the existing ConvertRecord processor and CSVRecordSetWriter.&lt;BR /&gt;This is outlined in following jira:&lt;BR /&gt;&lt;A href="https://issues.apache.org/jira/browse/NIFI-11167" target="_blank"&gt;https://issues.apache.org/jira/browse/NIFI-11167&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="batang,apple gothic"&gt;If you found that the provided solution(s) assisted you with your query, please take a moment to login and click&lt;/FONT&gt;&amp;nbsp;&lt;FONT face="arial black,avant garde" color="#FF0000"&gt;Accept as Solution&amp;nbsp;&lt;/FONT&gt;&lt;FONT face="batang,apple gothic" color="#000000"&gt;below each response that helped.&lt;BR /&gt;&lt;BR /&gt;Thank you,&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="batang,apple gothic" color="#000000"&gt;Matt&lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 28 Feb 2023 19:28:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ConvertExcelToCSVProcessor-File-too-Large/m-p/364850#M239252</guid>
      <dc:creator>MattWho</dc:creator>
      <dc:date>2023-02-28T19:28:52Z</dc:date>
    </item>
  </channel>
</rss>

