<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: ExcelReader Exception in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/ExcelReader-Exception/m-p/398361#M250166</link>
    <description>&lt;P&gt;Hi &lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/117154"&gt;@Mikhai&lt;/a&gt; ,&lt;/P&gt;&lt;P&gt;Its hard to say what is going on without looking at the data itself or seeing the ExcelReader Configuration. I know providing the data is not easy but if you can replicate the issue using dummy data then please share. Also if you can provide more details on how you configured the ExcelReader, for example are you using custom schema or infering the schema?&lt;/P&gt;&lt;P&gt;I would try the following:&lt;/P&gt;&lt;P&gt;1- Try to find table boundary in excel and delete empty rows. If you cant then for sake of testing copy the table with the rows you need into new excel and see if that works.&lt;/P&gt;&lt;P&gt;2- If ExcelReader works with 545 rows , then I will try and provide custom schema - if not provided - and try to set some of the fields where there should be a value to not allow null. Maybe by doing so it will help the ExcelReader not to import empty rows.&lt;/P&gt;&lt;P&gt;I tried to use ExcelReader before but ran into issues when the excel has some formula columns because of a bug in the reader itself. Im not sure if those issues were addressed but as workaround I used Python Extension to develop custom processor that takes excel input and convert into Json using Pandas library. This might be an option to consider if you are still having problems with the ExcelReader service but you have to use Nifi 2.0 version in order to use python extension.&lt;/P&gt;&lt;P&gt;If that helps please accept the solution,&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 03 Dec 2024 12:36:14 GMT</pubDate>
    <dc:creator>SAMSAL</dc:creator>
    <dc:date>2024-12-03T12:36:14Z</dc:date>
    <item>
      <title>ExcelReader Exception</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ExcelReader-Exception/m-p/398328#M250161</link>
      <description>&lt;P&gt;Hi everyone&lt;BR /&gt;I try to split to records an Excel sheet, using ExcelReader. But get this error:&lt;/P&gt;&lt;LI-SPOILER&gt;SplitRecord[id=01921045-9bc9-1353-b2a2-96dfb4f1e8af] Failed to split FlowFile[filename=test.xlsx]: org.apache.nifi.processor.exception.ProcessException: IOException thrown from SplitRecord[id=01921045-9bc9-1353-b2a2-96dfb4f1e8af]: java.io.IOException: org.apache.nifi.serialization.MalformedRecordException: Read next Record from Excel XLSX failed on row 546 in sheet reportdata1&lt;BR /&gt;- Caused by: java.io.IOException: org.apache.nifi.serialization.MalformedRecordException: Read next Record from Excel XLSX failed on row 546 in sheet reportdata1&lt;BR /&gt;- Caused by: org.apache.nifi.serialization.MalformedRecordException: Read next Record from Excel XLSX failed on row 546 in sheet reportdata1&lt;BR /&gt;- Caused by: java.lang.NumberFormatException: For input string: ""&lt;/LI-SPOILER&gt;&lt;P&gt;&lt;BR /&gt;&lt;SPAN class="EzKURWReUAB5oZgtQNkl"&gt;Indeed&lt;/SPAN&gt;&lt;SPAN class="EzKURWReUAB5oZgtQNkl"&gt;,&lt;/SPAN&gt;&lt;SPAN&gt; there are &lt;SPAN class="EzKURWReUAB5oZgtQNkl"&gt;545&lt;/SPAN&gt; &lt;SPAN class="EzKURWReUAB5oZgtQNkl"&gt;filled&lt;/SPAN&gt;&lt;/SPAN&gt;&amp;nbsp;&lt;SPAN class="EzKURWReUAB5oZgtQNkl"&gt;lines&lt;/SPAN&gt; &lt;SPAN class="EzKURWReUAB5oZgtQNkl"&gt;on&lt;/SPAN&gt;&lt;SPAN&gt; the &lt;/SPAN&gt;&lt;SPAN class="EzKURWReUAB5oZgtQNkl"&gt;sheet, but why ExcelReader doesn't stop, when find empty line?&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 02 Dec 2024 14:39:01 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ExcelReader-Exception/m-p/398328#M250161</guid>
      <dc:creator>Mikhai</dc:creator>
      <dc:date>2024-12-02T14:39:01Z</dc:date>
    </item>
    <item>
      <title>Re: ExcelReader Exception</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ExcelReader-Exception/m-p/398349#M250164</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/117154"&gt;@Mikhai&lt;/a&gt;,&amp;nbsp;Welcome to our community! To help you get the best possible answer, I have tagged in our NiFi experts&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/80381"&gt;@SAMSAL&lt;/a&gt;&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/35454"&gt;@MattWho&lt;/a&gt;&amp;nbsp;&amp;nbsp;who may be able to assist you further.&lt;BR /&gt;&lt;BR /&gt;Please feel free to provide any additional information or details about your query, and we hope that you will find a satisfactory solution to your question.&lt;/P&gt;</description>
      <pubDate>Tue, 03 Dec 2024 10:38:33 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ExcelReader-Exception/m-p/398349#M250164</guid>
      <dc:creator>VidyaSargur</dc:creator>
      <dc:date>2024-12-03T10:38:33Z</dc:date>
    </item>
    <item>
      <title>Re: ExcelReader Exception</title>
      <link>https://community.cloudera.com/t5/Support-Questions/ExcelReader-Exception/m-p/398361#M250166</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/117154"&gt;@Mikhai&lt;/a&gt; ,&lt;/P&gt;&lt;P&gt;Its hard to say what is going on without looking at the data itself or seeing the ExcelReader Configuration. I know providing the data is not easy but if you can replicate the issue using dummy data then please share. Also if you can provide more details on how you configured the ExcelReader, for example are you using custom schema or infering the schema?&lt;/P&gt;&lt;P&gt;I would try the following:&lt;/P&gt;&lt;P&gt;1- Try to find table boundary in excel and delete empty rows. If you cant then for sake of testing copy the table with the rows you need into new excel and see if that works.&lt;/P&gt;&lt;P&gt;2- If ExcelReader works with 545 rows , then I will try and provide custom schema - if not provided - and try to set some of the fields where there should be a value to not allow null. Maybe by doing so it will help the ExcelReader not to import empty rows.&lt;/P&gt;&lt;P&gt;I tried to use ExcelReader before but ran into issues when the excel has some formula columns because of a bug in the reader itself. Im not sure if those issues were addressed but as workaround I used Python Extension to develop custom processor that takes excel input and convert into Json using Pandas library. This might be an option to consider if you are still having problems with the ExcelReader service but you have to use Nifi 2.0 version in order to use python extension.&lt;/P&gt;&lt;P&gt;If that helps please accept the solution,&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 03 Dec 2024 12:36:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/ExcelReader-Exception/m-p/398361#M250166</guid>
      <dc:creator>SAMSAL</dc:creator>
      <dc:date>2024-12-03T12:36:14Z</dc:date>
    </item>
  </channel>
</rss>

