<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Single Hive table pointing to multiple storage- S3 and HDFS in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Single-Hive-table-pointing-to-multiple-storage-S3-and-HDFS/m-p/97850#M11291</link>
    <description>&lt;P&gt;To be able to use both S3 and HDFS for your Hive table, you could use an external table with partitions pointing to different locations.&lt;/P&gt;&lt;P&gt;Look for the process that starts at "&lt;EM&gt;An interesting benefit of this flexibility is that we can archive old data on inexpensive storage&lt;/EM&gt;" in this link:&lt;/P&gt;&lt;P&gt;&lt;A href="https://www.safaribooksonline.com/library/view/programming-hive/9781449326944/ch04.html"&gt;Hive def guide&lt;/A&gt;&lt;/P&gt;&lt;P&gt;To automate this process, you could use Cron but I guess Falcon should also be possible.&lt;/P&gt;</description>
    <pubDate>Thu, 03 Dec 2015 13:50:16 GMT</pubDate>
    <dc:creator>sluangsay</dc:creator>
    <dc:date>2015-12-03T13:50:16Z</dc:date>
    <item>
      <title>Single Hive table pointing to multiple storage- S3 and HDFS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Single-Hive-table-pointing-to-multiple-storage-S3-and-HDFS/m-p/97848#M11289</link>
      <description>&lt;P&gt;Lets assume we have data in hive table for past 60 days. How to automatically move the data beyond a time period (30 days) to S3 and have only the latest 30 days data in hdfs. How to write a hive query to read the entire 60 days data ? How to point single hive table to multiple data storage - S3 and hdfs ?&lt;/P&gt;&lt;P&gt;Also is it possible to configure S3 as archival storage ?&lt;/P&gt;&lt;P&gt;&lt;A href="http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.2/bk_hdfs_admin_tools/content/configuring_archival_storage.html" target="_blank"&gt;http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.2/bk_hdfs_admin_tools/content/configuring_archival_storage.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 03 Dec 2015 12:31:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Single-Hive-table-pointing-to-multiple-storage-S3-and-HDFS/m-p/97848#M11289</guid>
      <dc:creator>nbalaji-elangov</dc:creator>
      <dc:date>2015-12-03T12:31:38Z</dc:date>
    </item>
    <item>
      <title>Re: Single Hive table pointing to multiple storage- S3 and HDFS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Single-Hive-table-pointing-to-multiple-storage-S3-and-HDFS/m-p/97849#M11290</link>
      <description>&lt;P&gt;See &lt;A href="http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.2/bk_data_governance/content/ch_falcon_retention_policy.html"&gt;this doc&lt;/A&gt; on setting Falcon retention (and cloud replication/export/archival) policies.&lt;/P&gt;&lt;P&gt;I don't think a single table can use multiple storage setups. However, you can use a &lt;A href="https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-Create/Drop/AlterView"&gt;View&lt;/A&gt; to be the union of one table defined on local HDFS and a second table defined on S3.&lt;/P&gt;</description>
      <pubDate>Thu, 03 Dec 2015 13:22:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Single-Hive-table-pointing-to-multiple-storage-S3-and-HDFS/m-p/97849#M11290</guid>
      <dc:creator>rgelhausen</dc:creator>
      <dc:date>2015-12-03T13:22:12Z</dc:date>
    </item>
    <item>
      <title>Re: Single Hive table pointing to multiple storage- S3 and HDFS</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Single-Hive-table-pointing-to-multiple-storage-S3-and-HDFS/m-p/97850#M11291</link>
      <description>&lt;P&gt;To be able to use both S3 and HDFS for your Hive table, you could use an external table with partitions pointing to different locations.&lt;/P&gt;&lt;P&gt;Look for the process that starts at "&lt;EM&gt;An interesting benefit of this flexibility is that we can archive old data on inexpensive storage&lt;/EM&gt;" in this link:&lt;/P&gt;&lt;P&gt;&lt;A href="https://www.safaribooksonline.com/library/view/programming-hive/9781449326944/ch04.html"&gt;Hive def guide&lt;/A&gt;&lt;/P&gt;&lt;P&gt;To automate this process, you could use Cron but I guess Falcon should also be possible.&lt;/P&gt;</description>
      <pubDate>Thu, 03 Dec 2015 13:50:16 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Single-Hive-table-pointing-to-multiple-storage-S3-and-HDFS/m-p/97850#M11291</guid>
      <dc:creator>sluangsay</dc:creator>
      <dc:date>2015-12-03T13:50:16Z</dc:date>
    </item>
  </channel>
</rss>

