<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Sqoop import - null values in HDFS files replaced by 'N' instead of '\N' in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Sqoop-import-null-values-in-HDFS-files-replaced-by-N-instead/m-p/59109#M67068</link>
    <description>&lt;P&gt;Hi,&lt;BR /&gt;&lt;BR /&gt;When I use sqoop command to import and load data from postgresql DB, even if I set the to params to:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;--null-string &lt;STRONG&gt;'\\N'&lt;/STRONG&gt; \
--null-non-string &lt;STRONG&gt;'\\N'&lt;/STRONG&gt; \&lt;/PRE&gt;&lt;P&gt;but I find that the null values in the HDFS file are replaced by &lt;U&gt;'&lt;STRONG&gt;N&lt;/STRONG&gt;' instead of '&lt;STRONG&gt;\N&lt;/STRONG&gt;'&lt;/U&gt; :&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;PRE&gt;#sudo -u hdfs hdfs dfs -cat /user/hive/warehouse/db/categories/part-m-00000 | head -3&lt;BR /&gt;371,2017-01-10 18:43:25,Cars,&lt;STRONG&gt;N&lt;/STRONG&gt;,CA,&lt;STRONG&gt;N&lt;/STRONG&gt;
372,2017-01-10 18:43:26,Motos,&lt;STRONG&gt;N&lt;/STRONG&gt;,CA,&lt;STRONG&gt;N&lt;/STRONG&gt;
373,2017-01-16 11:01:12,Tracks,&lt;STRONG&gt;N&lt;/STRONG&gt;,CA,1&lt;/PRE&gt;&lt;P&gt;&lt;SPAN class="short_text"&gt;&lt;SPAN&gt;Which&lt;/SPAN&gt;&lt;/SPAN&gt; results to u&lt;SPAN class="short_text"&gt;&lt;SPAN&gt;nrecognized NULL values mostly for the STRING type (because for the INT type the 'N'&amp;nbsp;is considered like NULL..).&lt;BR /&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;PRE&gt;# select * from categories limit 3;&lt;BR /&gt;+-----+----------------------+--------+--------+------+----------+
| id  | date                 | name   | status | cc   | valid    |
+-----+----------------------+--------+--------+------+----------+
| 371 | 2017-01-10 18:43:25  | Cars   | &lt;STRONG&gt;N&lt;/STRONG&gt;      | CA   | &lt;STRONG&gt;NULL&lt;/STRONG&gt;     |
| 372 | 2017-01-10 18:43:26  | Motos  | &lt;STRONG&gt;N&lt;/STRONG&gt;      | CA   | &lt;STRONG&gt;NULL&lt;/STRONG&gt;     |
| 373 | 2017-01-16 11:01:12  | Tracks | &lt;STRONG&gt;N&lt;/STRONG&gt;      | CA   | 1        |
+-----+----------------------+--------+--------+------+----------+&lt;/PRE&gt;&lt;PRE&gt;#select count(*) from categories&amp;nbsp;where status &lt;STRONG&gt;is null&lt;/STRONG&gt;;&lt;BR /&gt;Error converting column: &lt;STRONG&gt;5 to INT&lt;/STRONG&gt;&lt;BR /&gt;Error parsing row: file: hdfs://XXX:8020/user/hive/warehouse/...., before offset: 18698&lt;BR /&gt;Error parsing row: file: hdfs://XXX:8020/user/hive/warehouse/...., before offset: 18698&lt;BR /&gt;+----------+&lt;BR /&gt;| count(*) |&lt;BR /&gt;+----------+&lt;BR /&gt;| &lt;STRONG&gt;0&lt;/STRONG&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; |&lt;BR /&gt;+----------+&lt;BR /&gt;Fetched 1 row(s) in 0.19s&lt;/PRE&gt;&lt;P&gt;&lt;BR /&gt;But when I tested to import and load &lt;SPAN class="short_text"&gt;&lt;SPAN&gt;manually&lt;/SPAN&gt;&lt;/SPAN&gt;&amp;nbsp;with a direct hdfs put a&amp;nbsp;categories.csv file&amp;nbsp;that have a '\N' instead of NULL value I don't face any problem.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;PRE&gt;#sudo -u hdfs hdfs dfs -put categories.csv /user/hive/warehouse/db&lt;BR /&gt;#sudo -u hdfs hdfs dfs -cat /user/hive/warehouse/db/categories/categories.csv | head -3&lt;BR /&gt;371,2017-01-10 18:43:25,Cars,&lt;STRONG&gt;\N&lt;/STRONG&gt;,CA,&lt;STRONG&gt;\N&lt;/STRONG&gt;&lt;BR /&gt;372,2017-01-10 18:43:26,Motos,&lt;STRONG&gt;\N&lt;/STRONG&gt;,CA,&lt;STRONG&gt;\N&lt;/STRONG&gt;&lt;STRONG&gt;&lt;BR /&gt;&lt;/STRONG&gt;373,2017-01-16 11:01:12,Tracks,&lt;STRONG&gt;\N&lt;/STRONG&gt;,CA,1&lt;BR /&gt;# select * from categories limit 3;&lt;BR /&gt;+-----+----------------------+--------+--------+------+----------+&lt;BR /&gt;| id&amp;nbsp; | date&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; | name&amp;nbsp;&amp;nbsp; | status | cc&amp;nbsp;&amp;nbsp; | valid&amp;nbsp;&amp;nbsp;&amp;nbsp; |&lt;BR /&gt;+-----+----------------------+--------+--------+------+----------+&lt;BR /&gt;| 371 | 2017-01-10 18:43:25&amp;nbsp; | Cars&amp;nbsp;&amp;nbsp; | &lt;STRONG&gt;NULL&lt;/STRONG&gt;&amp;nbsp;&amp;nbsp; | CA&amp;nbsp;&amp;nbsp; | &lt;STRONG&gt;NULL&lt;/STRONG&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; |&lt;BR /&gt;| 372 | 2017-01-10 18:43:26&amp;nbsp; | Motos&amp;nbsp; | &lt;STRONG&gt;NULL&lt;/STRONG&gt;&amp;nbsp;&amp;nbsp; | CA&amp;nbsp;&amp;nbsp; | &lt;STRONG&gt;NULL&lt;/STRONG&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; |&lt;BR /&gt;| 373 | 2017-01-16 11:01:12&amp;nbsp; | Tracks | &lt;STRONG&gt;NULL&lt;/STRONG&gt;&amp;nbsp;&amp;nbsp; | CA&amp;nbsp;&amp;nbsp; | 1&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; |&lt;BR /&gt;+-----+----------------------+--------+--------+------+----------+&lt;BR /&gt;#select count(*) from categories&amp;nbsp;where status &lt;STRONG&gt;is null&lt;/STRONG&gt;;&lt;BR /&gt;+----------+&lt;BR /&gt;| count(*) |&lt;BR /&gt;+----------+&lt;BR /&gt;| &lt;STRONG&gt;3&lt;/STRONG&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; |&lt;BR /&gt;+----------+&lt;BR /&gt;Fetched 1 row(s) in 0.20s&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for your help.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 16 Sep 2022 12:07:45 GMT</pubDate>
    <dc:creator>AcharkiMed</dc:creator>
    <dc:date>2022-09-16T12:07:45Z</dc:date>
    <item>
      <title>Sqoop import - null values in HDFS files replaced by 'N' instead of '\N'</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Sqoop-import-null-values-in-HDFS-files-replaced-by-N-instead/m-p/59109#M67068</link>
      <description>&lt;P&gt;Hi,&lt;BR /&gt;&lt;BR /&gt;When I use sqoop command to import and load data from postgresql DB, even if I set the to params to:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;--null-string &lt;STRONG&gt;'\\N'&lt;/STRONG&gt; \
--null-non-string &lt;STRONG&gt;'\\N'&lt;/STRONG&gt; \&lt;/PRE&gt;&lt;P&gt;but I find that the null values in the HDFS file are replaced by &lt;U&gt;'&lt;STRONG&gt;N&lt;/STRONG&gt;' instead of '&lt;STRONG&gt;\N&lt;/STRONG&gt;'&lt;/U&gt; :&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;PRE&gt;#sudo -u hdfs hdfs dfs -cat /user/hive/warehouse/db/categories/part-m-00000 | head -3&lt;BR /&gt;371,2017-01-10 18:43:25,Cars,&lt;STRONG&gt;N&lt;/STRONG&gt;,CA,&lt;STRONG&gt;N&lt;/STRONG&gt;
372,2017-01-10 18:43:26,Motos,&lt;STRONG&gt;N&lt;/STRONG&gt;,CA,&lt;STRONG&gt;N&lt;/STRONG&gt;
373,2017-01-16 11:01:12,Tracks,&lt;STRONG&gt;N&lt;/STRONG&gt;,CA,1&lt;/PRE&gt;&lt;P&gt;&lt;SPAN class="short_text"&gt;&lt;SPAN&gt;Which&lt;/SPAN&gt;&lt;/SPAN&gt; results to u&lt;SPAN class="short_text"&gt;&lt;SPAN&gt;nrecognized NULL values mostly for the STRING type (because for the INT type the 'N'&amp;nbsp;is considered like NULL..).&lt;BR /&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;PRE&gt;# select * from categories limit 3;&lt;BR /&gt;+-----+----------------------+--------+--------+------+----------+
| id  | date                 | name   | status | cc   | valid    |
+-----+----------------------+--------+--------+------+----------+
| 371 | 2017-01-10 18:43:25  | Cars   | &lt;STRONG&gt;N&lt;/STRONG&gt;      | CA   | &lt;STRONG&gt;NULL&lt;/STRONG&gt;     |
| 372 | 2017-01-10 18:43:26  | Motos  | &lt;STRONG&gt;N&lt;/STRONG&gt;      | CA   | &lt;STRONG&gt;NULL&lt;/STRONG&gt;     |
| 373 | 2017-01-16 11:01:12  | Tracks | &lt;STRONG&gt;N&lt;/STRONG&gt;      | CA   | 1        |
+-----+----------------------+--------+--------+------+----------+&lt;/PRE&gt;&lt;PRE&gt;#select count(*) from categories&amp;nbsp;where status &lt;STRONG&gt;is null&lt;/STRONG&gt;;&lt;BR /&gt;Error converting column: &lt;STRONG&gt;5 to INT&lt;/STRONG&gt;&lt;BR /&gt;Error parsing row: file: hdfs://XXX:8020/user/hive/warehouse/...., before offset: 18698&lt;BR /&gt;Error parsing row: file: hdfs://XXX:8020/user/hive/warehouse/...., before offset: 18698&lt;BR /&gt;+----------+&lt;BR /&gt;| count(*) |&lt;BR /&gt;+----------+&lt;BR /&gt;| &lt;STRONG&gt;0&lt;/STRONG&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; |&lt;BR /&gt;+----------+&lt;BR /&gt;Fetched 1 row(s) in 0.19s&lt;/PRE&gt;&lt;P&gt;&lt;BR /&gt;But when I tested to import and load &lt;SPAN class="short_text"&gt;&lt;SPAN&gt;manually&lt;/SPAN&gt;&lt;/SPAN&gt;&amp;nbsp;with a direct hdfs put a&amp;nbsp;categories.csv file&amp;nbsp;that have a '\N' instead of NULL value I don't face any problem.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;PRE&gt;#sudo -u hdfs hdfs dfs -put categories.csv /user/hive/warehouse/db&lt;BR /&gt;#sudo -u hdfs hdfs dfs -cat /user/hive/warehouse/db/categories/categories.csv | head -3&lt;BR /&gt;371,2017-01-10 18:43:25,Cars,&lt;STRONG&gt;\N&lt;/STRONG&gt;,CA,&lt;STRONG&gt;\N&lt;/STRONG&gt;&lt;BR /&gt;372,2017-01-10 18:43:26,Motos,&lt;STRONG&gt;\N&lt;/STRONG&gt;,CA,&lt;STRONG&gt;\N&lt;/STRONG&gt;&lt;STRONG&gt;&lt;BR /&gt;&lt;/STRONG&gt;373,2017-01-16 11:01:12,Tracks,&lt;STRONG&gt;\N&lt;/STRONG&gt;,CA,1&lt;BR /&gt;# select * from categories limit 3;&lt;BR /&gt;+-----+----------------------+--------+--------+------+----------+&lt;BR /&gt;| id&amp;nbsp; | date&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; | name&amp;nbsp;&amp;nbsp; | status | cc&amp;nbsp;&amp;nbsp; | valid&amp;nbsp;&amp;nbsp;&amp;nbsp; |&lt;BR /&gt;+-----+----------------------+--------+--------+------+----------+&lt;BR /&gt;| 371 | 2017-01-10 18:43:25&amp;nbsp; | Cars&amp;nbsp;&amp;nbsp; | &lt;STRONG&gt;NULL&lt;/STRONG&gt;&amp;nbsp;&amp;nbsp; | CA&amp;nbsp;&amp;nbsp; | &lt;STRONG&gt;NULL&lt;/STRONG&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; |&lt;BR /&gt;| 372 | 2017-01-10 18:43:26&amp;nbsp; | Motos&amp;nbsp; | &lt;STRONG&gt;NULL&lt;/STRONG&gt;&amp;nbsp;&amp;nbsp; | CA&amp;nbsp;&amp;nbsp; | &lt;STRONG&gt;NULL&lt;/STRONG&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; |&lt;BR /&gt;| 373 | 2017-01-16 11:01:12&amp;nbsp; | Tracks | &lt;STRONG&gt;NULL&lt;/STRONG&gt;&amp;nbsp;&amp;nbsp; | CA&amp;nbsp;&amp;nbsp; | 1&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; |&lt;BR /&gt;+-----+----------------------+--------+--------+------+----------+&lt;BR /&gt;#select count(*) from categories&amp;nbsp;where status &lt;STRONG&gt;is null&lt;/STRONG&gt;;&lt;BR /&gt;+----------+&lt;BR /&gt;| count(*) |&lt;BR /&gt;+----------+&lt;BR /&gt;| &lt;STRONG&gt;3&lt;/STRONG&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; |&lt;BR /&gt;+----------+&lt;BR /&gt;Fetched 1 row(s) in 0.20s&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for your help.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 12:07:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Sqoop-import-null-values-in-HDFS-files-replaced-by-N-instead/m-p/59109#M67068</guid>
      <dc:creator>AcharkiMed</dc:creator>
      <dc:date>2022-09-16T12:07:45Z</dc:date>
    </item>
    <item>
      <title>Re: Sqoop import - null values in HDFS files replaced by 'N' instead of '\N'</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Sqoop-import-null-values-in-HDFS-files-replaced-by-N-instead/m-p/59161#M67069</link>
      <description>&lt;P&gt;Looks as though sqoop isn't evaluating the argument properly. I've had similar things happen to me with wonky shell evaluation; which shell are you using?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Have you tried throwing a few more backslashes in there for good measure?&lt;/P&gt;</description>
      <pubDate>Thu, 24 Aug 2017 12:13:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Sqoop-import-null-values-in-HDFS-files-replaced-by-N-instead/m-p/59161#M67069</guid>
      <dc:creator>wsch</dc:creator>
      <dc:date>2017-08-24T12:13:27Z</dc:date>
    </item>
    <item>
      <title>Re: Sqoop import - null values in HDFS files replaced by 'N' instead of '\N'</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Sqoop-import-null-values-in-HDFS-files-replaced-by-N-instead/m-p/59285#M67070</link>
      <description>&lt;P&gt;Hi&lt;BR /&gt;&lt;BR /&gt;I found the solution by adding two antislashs before the old two.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;PRE&gt;--null-string '&lt;STRONG&gt;\\&lt;/STRONG&gt;\\N'
--null-non-string '&lt;STRONG&gt;\\&lt;/STRONG&gt;\\N'&lt;/PRE&gt;&lt;P&gt;&lt;BR /&gt;Thanks &lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/23539"&gt;@wsch&lt;/a&gt;&amp;nbsp;for your&amp;nbsp;replay.&lt;/P&gt;</description>
      <pubDate>Mon, 28 Aug 2017 10:24:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Sqoop-import-null-values-in-HDFS-files-replaced-by-N-instead/m-p/59285#M67070</guid>
      <dc:creator>AcharkiMed</dc:creator>
      <dc:date>2017-08-28T10:24:05Z</dc:date>
    </item>
  </channel>
</rss>

