<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Can I create Primary Key in Hive table? I saw in TBLPROPERTIES you can mention &amp;quot;PRIMARY KEY&amp;quot;=&amp;quot;col_name&amp;quot;, what it actually does? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Can-I-create-Primary-Key-in-Hive-table-I-saw-in/m-p/121081#M22478</link>
    <description>&lt;A rel="user" href="https://community.cloudera.com/users/2993/abdusmollah.html" nodeid="2993"&gt;@Abdus Sagir Mollah&lt;/A&gt;&lt;P&gt;Primary keys can also be useful for bucketing (i.e. paritioning of data) especially if you are trying to leverage the ACID capabilities of Hive. &lt;/P&gt;&lt;P&gt;Quote from the below blog: &lt;/P&gt;&lt;UL&gt;
&lt;LI&gt;&lt;EM&gt;Once an hour, a set of inserts and updates (up to 500k rows) for various dimension tables (eg. customer, inventory, stores) needs to be processed. The dimension tables have primary keys and are typically bucketed and sorted on those keys.&lt;/EM&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Entire blog: &lt;A href="http://hortonworks.com/blog/adding-acid-to-apache-hive/" target="_blank"&gt;http://hortonworks.com/blog/adding-acid-to-apache-hive/&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 10 Mar 2016 23:08:06 GMT</pubDate>
    <dc:creator>awatson</dc:creator>
    <dc:date>2016-03-10T23:08:06Z</dc:date>
    <item>
      <title>Can I create Primary Key in Hive table? I saw in TBLPROPERTIES you can mention "PRIMARY KEY"="col_name", what it actually does?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Can-I-create-Primary-Key-in-Hive-table-I-saw-in/m-p/121079#M22476</link>
      <description />
      <pubDate>Thu, 10 Mar 2016 20:30:40 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Can-I-create-Primary-Key-in-Hive-table-I-saw-in/m-p/121079#M22476</guid>
      <dc:creator>abdus_mollah</dc:creator>
      <dc:date>2016-03-10T20:30:40Z</dc:date>
    </item>
    <item>
      <title>Re: Can I create Primary Key in Hive table? I saw in TBLPROPERTIES you can mention "PRIMARY KEY"="col_name", what it actually does?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Can-I-create-Primary-Key-in-Hive-table-I-saw-in/m-p/121080#M22477</link>
      <description>&lt;P&gt; &lt;A rel="user" href="https://community.cloudera.com/users/2993/abdusmollah.html" nodeid="2993"&gt;@Abdus Sagir Mollah&lt;/A&gt; the designation of primary key is simply metadata describing the column. It doesn't apply any referential constraints. &lt;/P&gt;</description>
      <pubDate>Thu, 10 Mar 2016 21:12:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Can-I-create-Primary-Key-in-Hive-table-I-saw-in/m-p/121080#M22477</guid>
      <dc:creator>SQLShaw</dc:creator>
      <dc:date>2016-03-10T21:12:23Z</dc:date>
    </item>
    <item>
      <title>Re: Can I create Primary Key in Hive table? I saw in TBLPROPERTIES you can mention "PRIMARY KEY"="col_name", what it actually does?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Can-I-create-Primary-Key-in-Hive-table-I-saw-in/m-p/121081#M22478</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/2993/abdusmollah.html" nodeid="2993"&gt;@Abdus Sagir Mollah&lt;/A&gt;&lt;P&gt;Primary keys can also be useful for bucketing (i.e. paritioning of data) especially if you are trying to leverage the ACID capabilities of Hive. &lt;/P&gt;&lt;P&gt;Quote from the below blog: &lt;/P&gt;&lt;UL&gt;
&lt;LI&gt;&lt;EM&gt;Once an hour, a set of inserts and updates (up to 500k rows) for various dimension tables (eg. customer, inventory, stores) needs to be processed. The dimension tables have primary keys and are typically bucketed and sorted on those keys.&lt;/EM&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Entire blog: &lt;A href="http://hortonworks.com/blog/adding-acid-to-apache-hive/" target="_blank"&gt;http://hortonworks.com/blog/adding-acid-to-apache-hive/&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 10 Mar 2016 23:08:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Can-I-create-Primary-Key-in-Hive-table-I-saw-in/m-p/121081#M22478</guid>
      <dc:creator>awatson</dc:creator>
      <dc:date>2016-03-10T23:08:06Z</dc:date>
    </item>
    <item>
      <title>Re: Can I create Primary Key in Hive table? I saw in TBLPROPERTIES you can mention "PRIMARY KEY"="col_name", what it actually does?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Can-I-create-Primary-Key-in-Hive-table-I-saw-in/m-p/121082#M22479</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/200/awatson.html" nodeid="200"&gt;@Andrew Watson&lt;/A&gt; - The ACID properties have been taken back by the community. It is not recommended for customer use currently.&lt;/P&gt;</description>
      <pubDate>Thu, 10 Mar 2016 23:26:40 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Can-I-create-Primary-Key-in-Hive-table-I-saw-in/m-p/121082#M22479</guid>
      <dc:creator>sdutta</dc:creator>
      <dc:date>2016-03-10T23:26:40Z</dc:date>
    </item>
  </channel>
</rss>

