<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Is compression used for Hive temporary tables? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-compression-used-for-Hive-temporary-tables/m-p/117613#M26298</link>
    <description>&lt;P&gt;What do you mean with memory? As far as I know a temporary table is just like any other table with the one exception that it will be cleaned up when the session ends. So you can choose any storage format but it will be HDFS. So it depends. If you only need it once I would agree ORC is most likely not good but if you create a temp tables once and then query it a couple of times ORC definitely makes sense to me .&lt;/P&gt;&lt;P&gt;Edit: Interesting You could use the HDFS storage policies here. Do you have a cluster that has been setup like this? You could still use any kind of storage you want compressed or not and I still think that ORC will be good if you use your temporary table a couple times. &lt;/P&gt;&lt;PRE&gt;Starting in &lt;A href="https://issues.apache.org/jira/browse/HIVE-7313"&gt;Hive 1.1.0&lt;/A&gt; the storage policy for temporary tables can be set to &lt;CODE&gt;memory&lt;/CODE&gt;, &lt;CODE&gt;ssd&lt;/CODE&gt;, or &lt;CODE&gt;default&lt;/CODE&gt; with the &lt;A href="https://cwiki.apache.org/confluence/display/Hive/Configuration+Properties#ConfigurationProperties-hive.exec.temporary.table.storage"&gt;hive.exec.temporary.table.storage&lt;/A&gt; configuration parameter (see &lt;A href="http://hadoop.apache.org/docs/r2.6.0/hadoop-project-dist/hadoop-hdfs/ArchivalStorage.html#Storage_Types_and_Storage_Policies"&gt;HDFS Storage Types and Storage Policies&lt;/A&gt;).&lt;/PRE&gt;</description>
    <pubDate>Wed, 27 Apr 2016 21:49:49 GMT</pubDate>
    <dc:creator>bleonhardi</dc:creator>
    <dc:date>2016-04-27T21:49:49Z</dc:date>
    <item>
      <title>Is compression used for Hive temporary tables?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-compression-used-for-Hive-temporary-tables/m-p/117609#M26294</link>
      <description>&lt;P&gt;Assuming compression is enabled of course. &lt;/P&gt;</description>
      <pubDate>Wed, 27 Apr 2016 05:06:51 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-compression-used-for-Hive-temporary-tables/m-p/117609#M26294</guid>
      <dc:creator>TerryP</dc:creator>
      <dc:date>2016-04-27T05:06:51Z</dc:date>
    </item>
    <item>
      <title>Re: Is compression used for Hive temporary tables?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-compression-used-for-Hive-temporary-tables/m-p/117610#M26295</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/38/terry.html" nodeid="38"&gt;@Terry Padgett&lt;/A&gt; These are stored as uncompressed text files. &lt;/P&gt;</description>
      <pubDate>Wed, 27 Apr 2016 05:44:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-compression-used-for-Hive-temporary-tables/m-p/117610#M26295</guid>
      <dc:creator>ravi1</dc:creator>
      <dc:date>2016-04-27T05:44:42Z</dc:date>
    </item>
    <item>
      <title>Re: Is compression used for Hive temporary tables?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-compression-used-for-Hive-temporary-tables/m-p/117611#M26296</link>
      <description>&lt;P&gt;Which temporary tables are we talking about?&lt;/P&gt;&lt;P&gt;Tables you create with CREATE TEMPORARY TABLE?&lt;/P&gt;&lt;P&gt;These can have any storage format you want. So you you create it as ORC it definitely WILL be compressed. &lt;/P&gt;&lt;P&gt;Or what do you mean with "compression is enabled" ? &lt;/P&gt;&lt;P&gt;&lt;A href="https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-Create/Drop/TruncateTable" target="_blank"&gt;https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-Create/Drop/TruncateTable&lt;/A&gt;&lt;/P&gt;&lt;P&gt;There are also some internal structures for example the dataset that is generated by the Tez job before Hiveserver2 returns it to the client. This can be text or sequence file ( configurable ) but I heard there is a jira to use ORC for it instead. &lt;/P&gt;</description>
      <pubDate>Wed, 27 Apr 2016 20:38:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-compression-used-for-Hive-temporary-tables/m-p/117611#M26296</guid>
      <dc:creator>bleonhardi</dc:creator>
      <dc:date>2016-04-27T20:38:12Z</dc:date>
    </item>
    <item>
      <title>Re: Is compression used for Hive temporary tables?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-compression-used-for-Hive-temporary-tables/m-p/117612#M26297</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/168/bleonhardi.html" nodeid="168"&gt;@Benjamin Leonhardi&lt;/A&gt; Yes, these are Hive temporary tables. The feature is new'ish and I wanted to know if there are any surprises not mentioned in the language manual. Memory is one of the options for temporary table storage and I want to see if it is possible to fit the tables into memory. The tables are short-lived so I don't think ORC is a realistic choice at the moment but that could change.&lt;/P&gt;</description>
      <pubDate>Wed, 27 Apr 2016 21:18:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-compression-used-for-Hive-temporary-tables/m-p/117612#M26297</guid>
      <dc:creator>TerryP</dc:creator>
      <dc:date>2016-04-27T21:18:21Z</dc:date>
    </item>
    <item>
      <title>Re: Is compression used for Hive temporary tables?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-compression-used-for-Hive-temporary-tables/m-p/117613#M26298</link>
      <description>&lt;P&gt;What do you mean with memory? As far as I know a temporary table is just like any other table with the one exception that it will be cleaned up when the session ends. So you can choose any storage format but it will be HDFS. So it depends. If you only need it once I would agree ORC is most likely not good but if you create a temp tables once and then query it a couple of times ORC definitely makes sense to me .&lt;/P&gt;&lt;P&gt;Edit: Interesting You could use the HDFS storage policies here. Do you have a cluster that has been setup like this? You could still use any kind of storage you want compressed or not and I still think that ORC will be good if you use your temporary table a couple times. &lt;/P&gt;&lt;PRE&gt;Starting in &lt;A href="https://issues.apache.org/jira/browse/HIVE-7313"&gt;Hive 1.1.0&lt;/A&gt; the storage policy for temporary tables can be set to &lt;CODE&gt;memory&lt;/CODE&gt;, &lt;CODE&gt;ssd&lt;/CODE&gt;, or &lt;CODE&gt;default&lt;/CODE&gt; with the &lt;A href="https://cwiki.apache.org/confluence/display/Hive/Configuration+Properties#ConfigurationProperties-hive.exec.temporary.table.storage"&gt;hive.exec.temporary.table.storage&lt;/A&gt; configuration parameter (see &lt;A href="http://hadoop.apache.org/docs/r2.6.0/hadoop-project-dist/hadoop-hdfs/ArchivalStorage.html#Storage_Types_and_Storage_Policies"&gt;HDFS Storage Types and Storage Policies&lt;/A&gt;).&lt;/PRE&gt;</description>
      <pubDate>Wed, 27 Apr 2016 21:49:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-compression-used-for-Hive-temporary-tables/m-p/117613#M26298</guid>
      <dc:creator>bleonhardi</dc:creator>
      <dc:date>2016-04-27T21:49:49Z</dc:date>
    </item>
    <item>
      <title>Re: Is compression used for Hive temporary tables?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-compression-used-for-Hive-temporary-tables/m-p/117614#M26299</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/38/terry.html" nodeid="38"&gt;@Terry Padgett&lt;/A&gt;&lt;P&gt;If you want to store these temporary tables as ORC, it is still possible. Here is an example. &lt;/P&gt;&lt;PRE&gt;create temporary table tp1 stored as orcfile as select count(*) from table_params;&lt;/PRE&gt;&lt;P&gt;My earlier answer was whether the text format which is default is compressed on hdfs. &lt;/P&gt;</description>
      <pubDate>Wed, 27 Apr 2016 22:14:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-compression-used-for-Hive-temporary-tables/m-p/117614#M26299</guid>
      <dc:creator>ravi1</dc:creator>
      <dc:date>2016-04-27T22:14:35Z</dc:date>
    </item>
    <item>
      <title>Re: Is compression used for Hive temporary tables?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-compression-used-for-Hive-temporary-tables/m-p/117615#M26300</link>
      <description>&lt;P&gt;Ya &lt;A rel="user" href="https://community.cloudera.com/users/216/ravi.html" nodeid="216"&gt;@Ravi Mutyala&lt;/A&gt; , the temporary tables are only in use for a few minutes. My concern is also about any additional time being spent when writing the table as ORC. Probably have to run a bake off to see how it works in this case. &lt;/P&gt;</description>
      <pubDate>Wed, 27 Apr 2016 23:31:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-compression-used-for-Hive-temporary-tables/m-p/117615#M26300</guid>
      <dc:creator>TerryP</dc:creator>
      <dc:date>2016-04-27T23:31:18Z</dc:date>
    </item>
  </channel>
</rss>

