<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Hive table load in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-table-load/m-p/99346#M12594</link>
    <description>&lt;P&gt;If you do an "INSERT OVERWRITE" then all the files in the table's LOCATION will be deleted and replaced with the new data.&lt;/P&gt;</description>
    <pubDate>Thu, 17 Dec 2015 01:08:50 GMT</pubDate>
    <dc:creator>rich1</dc:creator>
    <dc:date>2015-12-17T01:08:50Z</dc:date>
    <item>
      <title>Hive table load</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-table-load/m-p/99341#M12589</link>
      <description>&lt;P&gt;Hello Friends,&lt;/P&gt;&lt;P&gt;I am new to Hadoop and Hive. Created a simple table with one column as ID and loaded data into this table from a file that's located in local filesyste with 6 (one with NULL)  records with command as "load data local inpath '/home/edureka/Desktop/data' into table  emp;"&lt;/P&gt;&lt;P&gt;Did select and it's show 5 records. Later manually changed the source file and removed all those 5 records and added 5 new records and loaded these new records without using the OVERWRITE with command  "load data local inpath '/home/edureka/Desktop/data' into table  emp;" Data load was successful.&lt;/P&gt;&lt;P&gt;This time if I do select * then I am getting 18 records. 1st set is repeating twice. I don't know why it's showing like this. Am I missing any command ? pls help me to understand.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Pls refer the screenshot.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Thanks&lt;A href="https://community.cloudera.com/legacyfs/online/attachments/839-capture.png"&gt;capture.png&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 16 Dec 2015 03:25:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-table-load/m-p/99341#M12589</guid>
      <dc:creator>SatishS</dc:creator>
      <dc:date>2015-12-16T03:25:38Z</dc:date>
    </item>
    <item>
      <title>Re: Hive table load</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-table-load/m-p/99342#M12590</link>
      <description>&lt;P&gt;I think you answered your own question: you did not use OVERWRITE on the second "load" command, so you added the records twice. If you wanted to start over w/ all new data in the table, run the load command with OVERWRITE.&lt;/P&gt;</description>
      <pubDate>Wed, 16 Dec 2015 10:00:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-table-load/m-p/99342#M12590</guid>
      <dc:creator>rich1</dc:creator>
      <dc:date>2015-12-16T10:00:38Z</dc:date>
    </item>
    <item>
      <title>Re: Hive table load</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-table-load/m-p/99343#M12591</link>
      <description>&lt;P&gt;Have you expected to end up with only 12 records instead of 18?&lt;/P&gt;&lt;P&gt;If '/home/edureka/Desktop/data' is a directory and your file called d1 I suspect that after changing it file d1~ was created. So the second time both files were loaded into the table causing 18 records.&lt;/P&gt;</description>
      <pubDate>Wed, 16 Dec 2015 16:45:26 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-table-load/m-p/99343#M12591</guid>
      <dc:creator>pminovic</dc:creator>
      <dc:date>2015-12-16T16:45:26Z</dc:date>
    </item>
    <item>
      <title>Re: Hive table load</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-table-load/m-p/99344#M12592</link>
      <description>&lt;P&gt;Thanks. What should I use if I want to override all data (I mean fresh load)?&lt;/P&gt;</description>
      <pubDate>Thu, 17 Dec 2015 00:41:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-table-load/m-p/99344#M12592</guid>
      <dc:creator>SatishS</dc:creator>
      <dc:date>2015-12-17T00:41:18Z</dc:date>
    </item>
    <item>
      <title>Re: Hive table load</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-table-load/m-p/99345#M12593</link>
      <description>&lt;P&gt;No, I have single file in data directory. First time I had created with 5 records and loaded this file into table and came back to same file, deleted all 5 records and entered 5 new records and reloaded again. I was missing the OVERWRITE keyword in the query. now it's fine. Thanks.&lt;/P&gt;</description>
      <pubDate>Thu, 17 Dec 2015 00:43:26 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-table-load/m-p/99345#M12593</guid>
      <dc:creator>SatishS</dc:creator>
      <dc:date>2015-12-17T00:43:26Z</dc:date>
    </item>
    <item>
      <title>Re: Hive table load</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-table-load/m-p/99346#M12594</link>
      <description>&lt;P&gt;If you do an "INSERT OVERWRITE" then all the files in the table's LOCATION will be deleted and replaced with the new data.&lt;/P&gt;</description>
      <pubDate>Thu, 17 Dec 2015 01:08:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-table-load/m-p/99346#M12594</guid>
      <dc:creator>rich1</dc:creator>
      <dc:date>2015-12-17T01:08:50Z</dc:date>
    </item>
  </channel>
</rss>

