<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Data Compression Doesn't work in ORC with SNAPPY Compression in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Data-Compression-Doesn-t-work-in-ORC-with-SNAPPY-Compression/m-p/172153#M57960</link>
    <description>&lt;P&gt;Thanks &lt;A rel="user" href="https://community.cloudera.com/users/222/deepesh.html" nodeid="222"&gt;@Deepesh&lt;/A&gt;. You are right default compression is ZLIB and that causes the difference in compression. &lt;/P&gt;</description>
    <pubDate>Fri, 24 Mar 2017 00:01:05 GMT</pubDate>
    <dc:creator>balavignesh_nag</dc:creator>
    <dc:date>2017-03-24T00:01:05Z</dc:date>
    <item>
      <title>Data Compression Doesn't work in ORC with SNAPPY Compression</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Data-Compression-Doesn-t-work-in-ORC-with-SNAPPY-Compression/m-p/172151#M57958</link>
      <description>&lt;P&gt;I have a hive managed partition table (4 partitions) which has 2TB of data and it is stored as ORC tables with no compression. Now I have created a duplicate table with ORC -- SNAPPY compression and inserted the data from old table into the duplicate table. I noticed that it took more loading time than usual I believe that's because of enabling the compression. Then i have checked the file size in duplicate table with snappy compression and it shows somewhere around 2.6TB. Verified the count of both the tables and it remains the same. Any idea why the difference in size even after enabling the snappy compression in ORC? &lt;/P&gt;</description>
      <pubDate>Thu, 23 Mar 2017 19:58:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Data-Compression-Doesn-t-work-in-ORC-with-SNAPPY-Compression/m-p/172151#M57958</guid>
      <dc:creator>balavignesh_nag</dc:creator>
      <dc:date>2017-03-23T19:58:56Z</dc:date>
    </item>
    <item>
      <title>Re: Data Compression Doesn't work in ORC with SNAPPY Compression</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Data-Compression-Doesn-t-work-in-ORC-with-SNAPPY-Compression/m-p/172152#M57959</link>
      <description>&lt;P&gt;Are you sure that the ORC tables you created were with no compression. By default hive.exec.orc.default.compress is set to ZLIB, perhaps your original table is with zlib compression.&lt;/P&gt;&lt;P&gt;There are some interesting threads to read:&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.hortonworks.com/questions/4067/snappy-vs-zlib-pros-and-cons-for-each-compression.html" target="_blank"&gt;https://community.hortonworks.com/questions/4067/snappy-vs-zlib-pros-and-cons-for-each-compression.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.hortonworks.com/articles/49252/performance-comparison-bw-orc-snappy-and-zlib-in-h.html" target="_blank"&gt;https://community.hortonworks.com/articles/49252/performance-comparison-bw-orc-snappy-and-zlib-in-h.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 23 Mar 2017 23:06:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Data-Compression-Doesn-t-work-in-ORC-with-SNAPPY-Compression/m-p/172152#M57959</guid>
      <dc:creator>deepesh1</dc:creator>
      <dc:date>2017-03-23T23:06:15Z</dc:date>
    </item>
    <item>
      <title>Re: Data Compression Doesn't work in ORC with SNAPPY Compression</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Data-Compression-Doesn-t-work-in-ORC-with-SNAPPY-Compression/m-p/172153#M57960</link>
      <description>&lt;P&gt;Thanks &lt;A rel="user" href="https://community.cloudera.com/users/222/deepesh.html" nodeid="222"&gt;@Deepesh&lt;/A&gt;. You are right default compression is ZLIB and that causes the difference in compression. &lt;/P&gt;</description>
      <pubDate>Fri, 24 Mar 2017 00:01:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Data-Compression-Doesn-t-work-in-ORC-with-SNAPPY-Compression/m-p/172153#M57960</guid>
      <dc:creator>balavignesh_nag</dc:creator>
      <dc:date>2017-03-24T00:01:05Z</dc:date>
    </item>
  </channel>
</rss>

