<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: How HCatalog is different from Hive? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/How-HCatalog-is-different-from-Hive/m-p/215292#M177202</link>
    <description>&lt;P&gt;Let's start with &lt;EM&gt;Hive&lt;/EM&gt; and then &lt;EM&gt;HCatalog.&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Hive&lt;/STRONG&gt; &lt;/P&gt;&lt;UL&gt;
&lt;LI&gt;Layer for analyzing, querying and managing large datasets that reside in Hadoop various file systems&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;⇢ uses &lt;EM&gt;HiveQL&lt;/EM&gt; (HQL) as processing engine&lt;/P&gt;&lt;P&gt;⇢ uses &lt;EM&gt;SerDes&lt;/EM&gt; for serialization and deserialization&lt;/P&gt;&lt;P&gt;⇢ works best with huge volumes of data&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;HCatalog&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;
&lt;LI&gt;Table and storage management layer for Hadoop&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;⇢ basically, the &lt;EM&gt;EDW&lt;/EM&gt; system for Hadoop (it supports several file formats such as &lt;EM&gt;RCFile&lt;/EM&gt;, &lt;EM&gt;CSV&lt;/EM&gt;, &lt;EM&gt;JSON&lt;/EM&gt;, &lt;EM&gt;SequenceFile&lt;/EM&gt;, &lt;EM&gt;ORC)&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;⇢ is a sub-component of Hive, which enables ETL processes&lt;/P&gt;&lt;P&gt;⇢ tool for accessing metadata that reside in &lt;EM&gt;Hive Metastore&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;⇢ acts as an API to expose the metastore as REST interface to external tools such as Pig&lt;/P&gt;&lt;P&gt;⇢ uses &lt;EM&gt;WebHcat&lt;/EM&gt;, a web server for engaging with the &lt;EM&gt;Hive Metastore&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;I think the focus has to be made on how they complement each other rather than focusing on their differences.&lt;/P&gt;&lt;P&gt;Documentation (3)&lt;/P&gt;&lt;UL&gt;
&lt;LI&gt;This &lt;A href="https://community.hortonworks.com/questions/23797/how-the-hive-metastore-works.html"&gt;answer&lt;/A&gt; from &lt;A rel="user" href="https://community.cloudera.com/users/186/sshaw.html" nodeid="186"&gt;@Scott Shaw&lt;/A&gt; is worth checking&lt;/LI&gt;&lt;LI&gt;This &lt;A href="https://fr.slideshare.net/markgrover/introduction-to-hive-and-hcatalog"&gt;slideshare&lt;/A&gt; presents the use cases and features of &lt;EM&gt;Hive&lt;/EM&gt; and &lt;EM&gt;Hcatalog&lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;This direct &lt;A href="https://www.ibm.com/support/knowledgecenter/en/SSKKZM_1.3.0/fuwucon_forward_hadoop.htm"&gt;graph&lt;/A&gt; from IBM shows how they use both layers in a &lt;EM&gt;batch job&lt;/EM&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;I hope this helps! &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 10 Dec 2018 09:58:52 GMT</pubDate>
    <dc:creator>zaki</dc:creator>
    <dc:date>2018-12-10T09:58:52Z</dc:date>
  </channel>
</rss>

