<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question incremental load from mysql to hdfs in hadoop in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/incremental-load-from-mysql-to-hdfs-in-hadoop/m-p/163478#M57397</link>
    <description>&lt;P&gt;From mysql to hdfs directory.&lt;/P&gt;&lt;PRE&gt;
sqoop import --connect jdbc:mysql://localhost/hadoopdb --username smas --password MyNewPass --table emp1 -m 1 --target-dir /data_new7 --incremental append --check-column id -last-value 2

&lt;/PRE&gt;&lt;P&gt;i have /date_new7/part-m-00000 also it didnot work ?&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="13761-1.png" style="width: 912px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/20510iF429E7A397FD2CC4/image-size/medium?v=v2&amp;amp;px=400" role="button" title="13761-1.png" alt="13761-1.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="13762-2.png" style="width: 1247px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/20511i63C28E80ACD73C8C/image-size/medium?v=v2&amp;amp;px=400" role="button" title="13762-2.png" alt="13762-2.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="13763-3.png" style="width: 1246px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/20512iBA143017CFD9E86D/image-size/medium?v=v2&amp;amp;px=400" role="button" title="13763-3.png" alt="13763-3.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;how to make sure that part-m-00000 is updated with 3rd row or id .&lt;/P&gt;&lt;P&gt;it is updating as a seperate table ? any suggestion ?&lt;/P&gt;</description>
    <pubDate>Sun, 18 Aug 2019 11:34:16 GMT</pubDate>
    <dc:creator>mithleshdb8</dc:creator>
    <dc:date>2019-08-18T11:34:16Z</dc:date>
    <item>
      <title>incremental load from mysql to hdfs in hadoop</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/incremental-load-from-mysql-to-hdfs-in-hadoop/m-p/163478#M57397</link>
      <description>&lt;P&gt;From mysql to hdfs directory.&lt;/P&gt;&lt;PRE&gt;
sqoop import --connect jdbc:mysql://localhost/hadoopdb --username smas --password MyNewPass --table emp1 -m 1 --target-dir /data_new7 --incremental append --check-column id -last-value 2

&lt;/PRE&gt;&lt;P&gt;i have /date_new7/part-m-00000 also it didnot work ?&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="13761-1.png" style="width: 912px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/20510iF429E7A397FD2CC4/image-size/medium?v=v2&amp;amp;px=400" role="button" title="13761-1.png" alt="13761-1.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="13762-2.png" style="width: 1247px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/20511i63C28E80ACD73C8C/image-size/medium?v=v2&amp;amp;px=400" role="button" title="13762-2.png" alt="13762-2.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="13763-3.png" style="width: 1246px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/20512iBA143017CFD9E86D/image-size/medium?v=v2&amp;amp;px=400" role="button" title="13763-3.png" alt="13763-3.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;how to make sure that part-m-00000 is updated with 3rd row or id .&lt;/P&gt;&lt;P&gt;it is updating as a seperate table ? any suggestion ?&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 11:34:16 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/incremental-load-from-mysql-to-hdfs-in-hadoop/m-p/163478#M57397</guid>
      <dc:creator>mithleshdb8</dc:creator>
      <dc:date>2019-08-18T11:34:16Z</dc:date>
    </item>
    <item>
      <title>Re: incremental load from mysql to hdfs in hadoop</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/incremental-load-from-mysql-to-hdfs-in-hadoop/m-p/163479#M57398</link>
      <description>&lt;P&gt;It worked. part-m-00001 is not a separate table, it's just another file in your import directory. If you create an external table on /date_new7, Hive will see a single table with 3 rows. Ditto for Map-reduce jobs taking /date_new7 as their input. If you end up with many small files you can merge them into one (from time to time) by using for example hadoop-streaming, see &lt;A href="https://hadoop.apache.org/docs/r2.7.2/hadoop-streaming/HadoopStreaming.html#Specifying_the_Number_of_Reducers"&gt;this example &lt;/A&gt;and set "mapreduce.job.reduces=1".&lt;/P&gt;</description>
      <pubDate>Sat, 18 Mar 2017 15:22:37 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/incremental-load-from-mysql-to-hdfs-in-hadoop/m-p/163479#M57398</guid>
      <dc:creator>pminovic</dc:creator>
      <dc:date>2017-03-18T15:22:37Z</dc:date>
    </item>
  </channel>
</rss>

