<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: What is serialization in ORC? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-is-serialization-in-ORC/m-p/224405#M75046</link>
    <description>&lt;P&gt;Serialization is the algorithm by which data is written to disk or transmitted somewhere. Different applications have different ways to serialize data to optimize for a specific outcome, whether it is dealing with reads or writes. As it says in the Hive language manual, integers and strings are encoded to disk and compressed in different ways, and it lists out the rules which it uses to do so. For example, variable-width encoding optimizes the space usage of the data, as it uses less space to encode smaller data.&lt;/P&gt;&lt;P&gt;See the following Wikipedia article for more detail: &lt;/P&gt;&lt;P&gt;&lt;A href="https://en.wikipedia.org/wiki/Serialization" target="_blank"&gt;https://en.wikipedia.org/wiki/Serialization&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Tue, 27 Feb 2018 03:48:47 GMT</pubDate>
    <dc:creator>anarasimham</dc:creator>
    <dc:date>2018-02-27T03:48:47Z</dc:date>
    <item>
      <title>What is serialization in ORC?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-is-serialization-in-ORC/m-p/224404#M75045</link>
      <description>&lt;P&gt;Link: &lt;A href="https://cwiki.apache.org/confluence/display/Hive/LanguageManual+ORC" target="_blank"&gt;https://cwiki.apache.org/confluence/display/Hive/LanguageManual+ORC&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Above link has a part named serialization. Can somebody tell what serialization is and for what it is used for?&lt;/P&gt;</description>
      <pubDate>Tue, 27 Feb 2018 02:24:55 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-is-serialization-in-ORC/m-p/224404#M75045</guid>
      <dc:creator>Hadoopy</dc:creator>
      <dc:date>2018-02-27T02:24:55Z</dc:date>
    </item>
    <item>
      <title>Re: What is serialization in ORC?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-is-serialization-in-ORC/m-p/224405#M75046</link>
      <description>&lt;P&gt;Serialization is the algorithm by which data is written to disk or transmitted somewhere. Different applications have different ways to serialize data to optimize for a specific outcome, whether it is dealing with reads or writes. As it says in the Hive language manual, integers and strings are encoded to disk and compressed in different ways, and it lists out the rules which it uses to do so. For example, variable-width encoding optimizes the space usage of the data, as it uses less space to encode smaller data.&lt;/P&gt;&lt;P&gt;See the following Wikipedia article for more detail: &lt;/P&gt;&lt;P&gt;&lt;A href="https://en.wikipedia.org/wiki/Serialization" target="_blank"&gt;https://en.wikipedia.org/wiki/Serialization&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 27 Feb 2018 03:48:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-is-serialization-in-ORC/m-p/224405#M75046</guid>
      <dc:creator>anarasimham</dc:creator>
      <dc:date>2018-02-27T03:48:47Z</dc:date>
    </item>
  </channel>
</rss>

