<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: when do you not use orc tables? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/when-do-you-not-use-orc-tables/m-p/201443#M74336</link>
    <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/14451/pjalleda.html" nodeid="14451"&gt;@PJ&lt;/A&gt;, the honest truth is there is no good reason not to use ORC format. You can use another format like Parquet but it won't provide ACID, LLAP cache, or the same level of performance. I would say the decision is similar to not using indexes in a relational system or not running statistics. ORC is simply best practice for high performance data warehousing in Hive. &lt;/P&gt;&lt;P&gt;Keep in mind that LLAP will allow you to cache raw text files. This may be an option if you have some strict SLA preventing you from incurring the conversion delay of the text file to ORC. &lt;/P&gt;</description>
    <pubDate>Wed, 07 Feb 2018 02:26:30 GMT</pubDate>
    <dc:creator>SQLShaw</dc:creator>
    <dc:date>2018-02-07T02:26:30Z</dc:date>
    <item>
      <title>when do you not use orc tables?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/when-do-you-not-use-orc-tables/m-p/201442#M74335</link>
      <description>&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;I have some large tables in our hadoop cluster which are in text format, i would like to change all to orc ... is there something i need to worry about if all tables are orc? in what circumstances you dont use orc? &lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 12:49:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/when-do-you-not-use-orc-tables/m-p/201442#M74335</guid>
      <dc:creator>pmj</dc:creator>
      <dc:date>2022-09-16T12:49:58Z</dc:date>
    </item>
    <item>
      <title>Re: when do you not use orc tables?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/when-do-you-not-use-orc-tables/m-p/201443#M74336</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/14451/pjalleda.html" nodeid="14451"&gt;@PJ&lt;/A&gt;, the honest truth is there is no good reason not to use ORC format. You can use another format like Parquet but it won't provide ACID, LLAP cache, or the same level of performance. I would say the decision is similar to not using indexes in a relational system or not running statistics. ORC is simply best practice for high performance data warehousing in Hive. &lt;/P&gt;&lt;P&gt;Keep in mind that LLAP will allow you to cache raw text files. This may be an option if you have some strict SLA preventing you from incurring the conversion delay of the text file to ORC. &lt;/P&gt;</description>
      <pubDate>Wed, 07 Feb 2018 02:26:30 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/when-do-you-not-use-orc-tables/m-p/201443#M74336</guid>
      <dc:creator>SQLShaw</dc:creator>
      <dc:date>2018-02-07T02:26:30Z</dc:date>
    </item>
  </channel>
</rss>

