<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question What size of tables make the best out of ORC format? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-size-of-tables-make-the-best-out-of-ORC-format/m-p/96726#M10279</link>
    <description>&lt;P&gt;Is there any particular size of the Hive table from where ORC table shows better performance compared to other types [especially text]? User is planning to have the default stripe size.&lt;/P&gt;</description>
    <pubDate>Mon, 09 Nov 2015 11:02:32 GMT</pubDate>
    <dc:creator>vpoornalingam</dc:creator>
    <dc:date>2015-11-09T11:02:32Z</dc:date>
    <item>
      <title>What size of tables make the best out of ORC format?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-size-of-tables-make-the-best-out-of-ORC-format/m-p/96726#M10279</link>
      <description>&lt;P&gt;Is there any particular size of the Hive table from where ORC table shows better performance compared to other types [especially text]? User is planning to have the default stripe size.&lt;/P&gt;</description>
      <pubDate>Mon, 09 Nov 2015 11:02:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-size-of-tables-make-the-best-out-of-ORC-format/m-p/96726#M10279</guid>
      <dc:creator>vpoornalingam</dc:creator>
      <dc:date>2015-11-09T11:02:32Z</dc:date>
    </item>
    <item>
      <title>Re: What size of tables make the best out of ORC format?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-size-of-tables-make-the-best-out-of-ORC-format/m-p/96727#M10280</link>
      <description>&lt;P&gt;Regarding table size, it can be tunned using stripe size, compress size and indexes, see this documentation: &lt;A target="_blank" href="http://orc.apache.org/docs/hive-config.html"&gt;http://orc.apache.org/docs/hive-config.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;About performance, I believe ORC will have better performance than text files in most of the situations, but I would say Avro or SequenceFile will have a better performance for queries/use cases that needs full scans with all the columns (in tables with lots of columns). There might be an overhead for ORC to rebuild lines with lots and lots of columns.&lt;/P&gt;</description>
      <pubDate>Mon, 09 Nov 2015 21:32:03 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-size-of-tables-make-the-best-out-of-ORC-format/m-p/96727#M10280</guid>
      <dc:creator>gbraccialli3</dc:creator>
      <dc:date>2015-11-09T21:32:03Z</dc:date>
    </item>
  </channel>
</rss>

