<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Bloom filter maintenance or updates? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Bloom-filter-maintenance-or-updates/m-p/159841#M122230</link>
    <description>&lt;P&gt;I have an ORC table that I am updating daily with the contents of a CSV file. When I created the table, I specified a bloom filter column. Is there any maintenance I need to perform with subsequent inserts? The table is about 500MM records and gets 50MM new records daily.&lt;/P&gt;</description>
    <pubDate>Tue, 28 Jun 2016 05:11:08 GMT</pubDate>
    <dc:creator>moapurva</dc:creator>
    <dc:date>2016-06-28T05:11:08Z</dc:date>
    <item>
      <title>Bloom filter maintenance or updates?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Bloom-filter-maintenance-or-updates/m-p/159841#M122230</link>
      <description>&lt;P&gt;I have an ORC table that I am updating daily with the contents of a CSV file. When I created the table, I specified a bloom filter column. Is there any maintenance I need to perform with subsequent inserts? The table is about 500MM records and gets 50MM new records daily.&lt;/P&gt;</description>
      <pubDate>Tue, 28 Jun 2016 05:11:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Bloom-filter-maintenance-or-updates/m-p/159841#M122230</guid>
      <dc:creator>moapurva</dc:creator>
      <dc:date>2016-06-28T05:11:08Z</dc:date>
    </item>
    <item>
      <title>Re: Bloom filter maintenance or updates?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Bloom-filter-maintenance-or-updates/m-p/159842#M122231</link>
      <description>&lt;P&gt;Hive Transactions? Or a normal insert? Insert doesn't change anything since a new ORC file will be created and all ORC files will have their own bloom filter  index. I am pretty sure the same is true for ACID tables as well since the compactor effectively creates a new ORC file.&lt;/P&gt;</description>
      <pubDate>Tue, 28 Jun 2016 05:27:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Bloom-filter-maintenance-or-updates/m-p/159842#M122231</guid>
      <dc:creator>bleonhardi</dc:creator>
      <dc:date>2016-06-28T05:27:56Z</dc:date>
    </item>
    <item>
      <title>Re: Bloom filter maintenance or updates?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Bloom-filter-maintenance-or-updates/m-p/159843#M122232</link>
      <description>&lt;P&gt;Just basic inserts. This is great to learn - thanks for the quick reply!&lt;/P&gt;</description>
      <pubDate>Tue, 28 Jun 2016 05:30:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Bloom-filter-maintenance-or-updates/m-p/159843#M122232</guid>
      <dc:creator>moapurva</dc:creator>
      <dc:date>2016-06-28T05:30:28Z</dc:date>
    </item>
    <item>
      <title>Re: Bloom filter maintenance or updates?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Bloom-filter-maintenance-or-updates/m-p/159844#M122233</link>
      <description>&lt;P&gt;yeah if you want to see it in action look into the HDFS folder before the insert and after ( you should see a couple new files like 00000_1 ... in there ) . These are the newly added rows in the new output files from your insert job. You can look at the bloom filter indexes with hive --orcfiledump -rowindex ... &amp;lt;filename&amp;gt;&lt;/P&gt;&lt;P&gt;&lt;A href="http://www.slideshare.net/BenjaminLeonhardi/hive-loading-data" target="_blank"&gt;http://www.slideshare.net/BenjaminLeonhardi/hive-loading-data&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 28 Jun 2016 05:33:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Bloom-filter-maintenance-or-updates/m-p/159844#M122233</guid>
      <dc:creator>bleonhardi</dc:creator>
      <dc:date>2016-06-28T05:33:47Z</dc:date>
    </item>
  </channel>
</rss>

