<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Sqoop Import  --split-by with sql function in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Sqoop-Import-split-by-with-sql-function/m-p/188938#M151031</link>
    <description>&lt;P&gt;No the Boundary query does work, but i figured that even though we do split by using a cast on a varchar column, once the split is identified sqoop internally does a sort of the column on split by which is a varchar according to Mysql and it brings in Duplicate on the target system.&lt;/P&gt;&lt;P&gt;I found unless you use a integer column or do an incremental load (after load to staging) will not solve the issues of duplicate on target with Varchar datatype with your split-by column&lt;/P&gt;</description>
    <pubDate>Tue, 10 Jul 2018 01:22:28 GMT</pubDate>
    <dc:creator>elkrish</dc:creator>
    <dc:date>2018-07-10T01:22:28Z</dc:date>
  </channel>
</rss>

