<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: HDFS Heterogeneous Storage - Using AWS S3 as storage tier in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDFS-Heterogeneous-Storage-Using-AWS-S3-as-storage-tier/m-p/120932#M34249</link>
    <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/200/awatson.html" nodeid="200"&gt;@Andrew Watson&lt;/A&gt;, HDFS and S3 are distinct file systems. Today there is no way to use S3 as a storage tier within HDFS. You can use the S3A file system which is bundled in the Apache Hadoop distributions to store data in S3. However your application (or administrator) would have to make a conscious decision to use either HDFS or S3A.&lt;/P&gt;&lt;P&gt;You may find &lt;A href="https://issues.apache.org/jira/browse/HDFS-9806"&gt;HDFS-9806&lt;/A&gt; interesting. This is a proposal from Microsoft to use alternate filesystems like Amazon S3 or Microsoft Azure as storage types within HDFS. Sounds like it exactly addresses your use case.&lt;/P&gt;</description>
    <pubDate>Sat, 09 Jul 2016 01:51:35 GMT</pubDate>
    <dc:creator>ArpitAgarwal</dc:creator>
    <dc:date>2016-07-09T01:51:35Z</dc:date>
    <item>
      <title>HDFS Heterogeneous Storage - Using AWS S3 as storage tier</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDFS-Heterogeneous-Storage-Using-AWS-S3-as-storage-tier/m-p/120930#M34247</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;Is it possible to use AWS S3 as a storage tier within HDFS Heterogeneous Storage? If so, any insight would be greatly appreciated.&lt;/P&gt;</description>
      <pubDate>Fri, 08 Jul 2016 20:36:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDFS-Heterogeneous-Storage-Using-AWS-S3-as-storage-tier/m-p/120930#M34247</guid>
      <dc:creator>awatson</dc:creator>
      <dc:date>2016-07-08T20:36:12Z</dc:date>
    </item>
    <item>
      <title>Re: HDFS Heterogeneous Storage - Using AWS S3 as storage tier</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDFS-Heterogeneous-Storage-Using-AWS-S3-as-storage-tier/m-p/120931#M34248</link>
      <description>&lt;P&gt;have you looked at alluxio as a virtual layer over hdfs and s3&lt;/P&gt;</description>
      <pubDate>Sat, 09 Jul 2016 00:07:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDFS-Heterogeneous-Storage-Using-AWS-S3-as-storage-tier/m-p/120931#M34248</guid>
      <dc:creator>TimothySpann</dc:creator>
      <dc:date>2016-07-09T00:07:08Z</dc:date>
    </item>
    <item>
      <title>Re: HDFS Heterogeneous Storage - Using AWS S3 as storage tier</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDFS-Heterogeneous-Storage-Using-AWS-S3-as-storage-tier/m-p/120932#M34249</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/200/awatson.html" nodeid="200"&gt;@Andrew Watson&lt;/A&gt;, HDFS and S3 are distinct file systems. Today there is no way to use S3 as a storage tier within HDFS. You can use the S3A file system which is bundled in the Apache Hadoop distributions to store data in S3. However your application (or administrator) would have to make a conscious decision to use either HDFS or S3A.&lt;/P&gt;&lt;P&gt;You may find &lt;A href="https://issues.apache.org/jira/browse/HDFS-9806"&gt;HDFS-9806&lt;/A&gt; interesting. This is a proposal from Microsoft to use alternate filesystems like Amazon S3 or Microsoft Azure as storage types within HDFS. Sounds like it exactly addresses your use case.&lt;/P&gt;</description>
      <pubDate>Sat, 09 Jul 2016 01:51:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDFS-Heterogeneous-Storage-Using-AWS-S3-as-storage-tier/m-p/120932#M34249</guid>
      <dc:creator>ArpitAgarwal</dc:creator>
      <dc:date>2016-07-09T01:51:35Z</dc:date>
    </item>
    <item>
      <title>Re: HDFS Heterogeneous Storage - Using AWS S3 as storage tier</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDFS-Heterogeneous-Storage-Using-AWS-S3-as-storage-tier/m-p/120933#M34250</link>
      <description>&lt;P&gt;Sanjay Radia recently presented a new concept to be introduced into HDFS (Hadoop 3) called Storage Containers. Storage Containers are an extensibility mechanism that will allow HDFS to manage object storage, such as S3. Watch
&lt;A href="https://www.youtube.com/watch?v=SdmJHmpvp7E" target="_blank"&gt;https://www.youtube.com/watch?v=SdmJHmpvp7E&lt;/A&gt;
and see "EVOLVING HDFS TO A GENERALIZED DISTRIBUTED STORAGE SUBSYSTEM".&lt;/P&gt;</description>
      <pubDate>Tue, 12 Jul 2016 01:23:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDFS-Heterogeneous-Storage-Using-AWS-S3-as-storage-tier/m-p/120933#M34250</guid>
      <dc:creator>cstanca</dc:creator>
      <dc:date>2016-07-12T01:23:38Z</dc:date>
    </item>
  </channel>
</rss>

