<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Hive creating huge Temp files in HDFS in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-creating-huge-Temp-files-in-HDFS/m-p/137609#M31844</link>
    <description>&lt;P&gt;Hi All&lt;/P&gt;&lt;P&gt;Hive is creating GB size files in /tmp we are facing size issue because of this.&lt;/P&gt;&lt;P&gt;15.3 G  /tmp/hive/hive/e9f9943b-8b35-466a-9d61-17e8a86339f1/hive_2016-06-09_19-00-01_169_2126725244382661354-1/-mr-10001/.hive-staging_hive_2016-06-09_19-00-01_169_2126725244382661354-1/-ext-10002/000074_0 &lt;/P&gt;&lt;P&gt;15.3 G  /tmp/hive/hive/e9f9943b-8b35-466a-9d61-17e8a86339f1/hive_2016-06-09_19-00-01_169_2126725244382661354-1/-mr-10001/.hive-staging_hive_2016-06-09_19-00-01_169_2126725244382661354-1/-ext-10002/000075_0 &lt;/P&gt;&lt;P&gt;15.2 G  /tmp/hive/hive/e9f9943b-8b35-466a-9d61-17e8a86339f1/hive_2016-06-09_19-00-01_169_2126725244382661354-1/-mr-10001/.hive-staging_hive_2016-06-09_19-00-01_169_2126725244382661354-1/-ext-10002/000076_0 &lt;/P&gt;&lt;P&gt;15.2 G  /tmp/hive/hive/e9f9943b-8b35-466a-9d61-17e8a86339f1/hive_2016-06-09_19-00-01_169_2126725244382661354-1/-mr-10001/.hive-staging_hive_2016-06-09_19-00-01_169_2126725244382661354-1/-ext-10002/000077_0 &lt;/P&gt;&lt;P&gt;15.4 G  /tmp/hive/hive/e9f9943b-8b35-466a-9d61-17e8a86339f1/hive_2016-06-09_19-00-01_169_2126725244382661354-1/-mr-10001/.hive-staging_hive_2016-06-09_19-00-01_169_2126725244382661354-1/-ext-10002/000078_0&lt;/P&gt;&lt;P&gt;Any help is appreciated .&lt;/P&gt;&lt;P&gt;Thanks in advance&lt;/P&gt;</description>
    <pubDate>Tue, 14 Jun 2016 18:58:14 GMT</pubDate>
    <dc:creator>shihab_pri</dc:creator>
    <dc:date>2016-06-14T18:58:14Z</dc:date>
    <item>
      <title>Hive creating huge Temp files in HDFS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-creating-huge-Temp-files-in-HDFS/m-p/137609#M31844</link>
      <description>&lt;P&gt;Hi All&lt;/P&gt;&lt;P&gt;Hive is creating GB size files in /tmp we are facing size issue because of this.&lt;/P&gt;&lt;P&gt;15.3 G  /tmp/hive/hive/e9f9943b-8b35-466a-9d61-17e8a86339f1/hive_2016-06-09_19-00-01_169_2126725244382661354-1/-mr-10001/.hive-staging_hive_2016-06-09_19-00-01_169_2126725244382661354-1/-ext-10002/000074_0 &lt;/P&gt;&lt;P&gt;15.3 G  /tmp/hive/hive/e9f9943b-8b35-466a-9d61-17e8a86339f1/hive_2016-06-09_19-00-01_169_2126725244382661354-1/-mr-10001/.hive-staging_hive_2016-06-09_19-00-01_169_2126725244382661354-1/-ext-10002/000075_0 &lt;/P&gt;&lt;P&gt;15.2 G  /tmp/hive/hive/e9f9943b-8b35-466a-9d61-17e8a86339f1/hive_2016-06-09_19-00-01_169_2126725244382661354-1/-mr-10001/.hive-staging_hive_2016-06-09_19-00-01_169_2126725244382661354-1/-ext-10002/000076_0 &lt;/P&gt;&lt;P&gt;15.2 G  /tmp/hive/hive/e9f9943b-8b35-466a-9d61-17e8a86339f1/hive_2016-06-09_19-00-01_169_2126725244382661354-1/-mr-10001/.hive-staging_hive_2016-06-09_19-00-01_169_2126725244382661354-1/-ext-10002/000077_0 &lt;/P&gt;&lt;P&gt;15.4 G  /tmp/hive/hive/e9f9943b-8b35-466a-9d61-17e8a86339f1/hive_2016-06-09_19-00-01_169_2126725244382661354-1/-mr-10001/.hive-staging_hive_2016-06-09_19-00-01_169_2126725244382661354-1/-ext-10002/000078_0&lt;/P&gt;&lt;P&gt;Any help is appreciated .&lt;/P&gt;&lt;P&gt;Thanks in advance&lt;/P&gt;</description>
      <pubDate>Tue, 14 Jun 2016 18:58:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-creating-huge-Temp-files-in-HDFS/m-p/137609#M31844</guid>
      <dc:creator>shihab_pri</dc:creator>
      <dc:date>2016-06-14T18:58:14Z</dc:date>
    </item>
    <item>
      <title>Re: Hive creating huge Temp files in HDFS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-creating-huge-Temp-files-in-HDFS/m-p/137610#M31845</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/10731/shihabpriv.html" nodeid="10731"&gt;@Shihab&lt;/A&gt;&lt;P&gt;The temp tables are created during the application run as intermediate data. These intermediate tables will not be removed in case the application fails and cleanup does not happen.&lt;/P&gt;&lt;P&gt;Please check if applications are running which is generating data. Meanwhile, you can also try compressing the intermediate data by setting the property "hive.exec.compress.intermediate" as true in hive-site.xml.&lt;/P&gt;&lt;P&gt;The related compression codec and other options are determined from Hadoop configuration variables mapred.output.compress*.&lt;/P&gt;&lt;P&gt;Hope this helps.&lt;/P&gt;&lt;P&gt;Thanks and Regards,&lt;/P&gt;&lt;P&gt;Sindhu&lt;/P&gt;</description>
      <pubDate>Tue, 14 Jun 2016 19:07:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-creating-huge-Temp-files-in-HDFS/m-p/137610#M31845</guid>
      <dc:creator>ssubhas</dc:creator>
      <dc:date>2016-06-14T19:07:38Z</dc:date>
    </item>
    <item>
      <title>Re: Hive creating huge Temp files in HDFS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-creating-huge-Temp-files-in-HDFS/m-p/137611#M31846</link>
      <description>&lt;P&gt;Thanks for the fast response. &lt;/P&gt;</description>
      <pubDate>Tue, 14 Jun 2016 19:22:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-creating-huge-Temp-files-in-HDFS/m-p/137611#M31846</guid>
      <dc:creator>shihab_pri</dc:creator>
      <dc:date>2016-06-14T19:22:43Z</dc:date>
    </item>
    <item>
      <title>Re: Hive creating huge Temp files in HDFS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-creating-huge-Temp-files-in-HDFS/m-p/137612#M31847</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/10731/shihabpriv.html" nodeid="10731"&gt;@Shihab&lt;/A&gt;  Hive uses temporary Directory structures both on the node where Hive client  is running and the default HDFS instance.&lt;/P&gt;&lt;P&gt;These folders are used to store temp/imtermediary data for each query(as separate files)- gets cleaned up by hive client after a while(configurable) after successful execution of query , But sometimes gets pooled up on client abnormal termination.&lt;/P&gt;&lt;P&gt;One such configurable parameter on HDFS storage is hive.exec.scratchdir (generally set to /tmp/hive)&lt;/P&gt;&lt;P&gt;When writing data to a Hive table/partition, Hive will first write to a temporary location (ie hive.exec.scratchdir) and then move the data to the target table. (The storage could be your underlying filesystem .. could be HDFS (normal case) or S3)&lt;/P&gt;&lt;P&gt;Work around is to clean these directory structure through a cron Job periodically (when size exceeds)&lt;/P&gt;</description>
      <pubDate>Wed, 15 Jun 2016 20:11:01 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-creating-huge-Temp-files-in-HDFS/m-p/137612#M31847</guid>
      <dc:creator>dchiguruvad</dc:creator>
      <dc:date>2016-06-15T20:11:01Z</dc:date>
    </item>
  </channel>
</rss>

