<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: &amp;quot;Error parsing row: file&amp;quot; Table consists of multiple csv files in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/quot-Error-parsing-row-file-quot-Table-consists-of-multiple/m-p/287896#M213309</link>
    <description>&lt;P&gt;The error "&lt;SPAN&gt;Error converting column: 35 to TIMESTAMP" means there was an error when converting column 35 to the TIMESTAMP type.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;The error "Error parsing row: file: hdfs://blabla/foo_042019.csv, before offset: 2432696320" means there was an error while parsing the row at file offset&amp;nbsp;2432696320, in the file&amp;nbsp;foo_042019.csv.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;So it looks like there are several rows in your dataset where certain fields cannot be converted to TIMESTAMPs. You should be able to open up the file, and seek to the specified offset to find the rows that are corrupted.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;I believe, Hive does not throw an exception when given the same dataset, instead it converts the corrupted rows to NULL. The same behavior can be emulated in Impala by setting 'abort_on_error=false'. However, be warned that setting this option can mask data corruption issues. See&amp;nbsp;&lt;A href="https://impala.apache.org/docs/build/html/topics/impala_abort_on_error.html" target="_blank"&gt;https://impala.apache.org/docs/build/html/topics/impala_abort_on_error.html&lt;/A&gt;&amp;nbsp;for details.&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 17 Jan 2020 23:48:00 GMT</pubDate>
    <dc:creator>SahilTakiar</dc:creator>
    <dc:date>2020-01-17T23:48:00Z</dc:date>
  </channel>
</rss>

