<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Is HDFS Snapshot applicable to a very large file such TB? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Is-HDFS-Snapshot-applicable-to-a-very-large-file-such-TB/m-p/243489#M205287</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/15452/hlcastro.html" nodeid="15452"&gt;@Hamilton Castro&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Are these snapshots from Hbase  stored in HDFS ?&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;Krishna&lt;/P&gt;</description>
    <pubDate>Sat, 29 Jun 2019 03:36:17 GMT</pubDate>
    <dc:creator>ksankavaram</dc:creator>
    <dc:date>2019-06-29T03:36:17Z</dc:date>
    <item>
      <title>Is HDFS Snapshot applicable to a very large file such TB?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Is-HDFS-Snapshot-applicable-to-a-very-large-file-such-TB/m-p/243488#M205286</link>
      <description />
      <pubDate>Sat, 29 Jun 2019 03:03:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Is-HDFS-Snapshot-applicable-to-a-very-large-file-such-TB/m-p/243488#M205286</guid>
      <dc:creator>hlcastro</dc:creator>
      <dc:date>2019-06-29T03:03:15Z</dc:date>
    </item>
    <item>
      <title>Re: Is HDFS Snapshot applicable to a very large file such TB?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Is-HDFS-Snapshot-applicable-to-a-very-large-file-such-TB/m-p/243489#M205287</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/15452/hlcastro.html" nodeid="15452"&gt;@Hamilton Castro&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Are these snapshots from Hbase  stored in HDFS ?&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;Krishna&lt;/P&gt;</description>
      <pubDate>Sat, 29 Jun 2019 03:36:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Is-HDFS-Snapshot-applicable-to-a-very-large-file-such-TB/m-p/243489#M205287</guid>
      <dc:creator>ksankavaram</dc:creator>
      <dc:date>2019-06-29T03:36:17Z</dc:date>
    </item>
    <item>
      <title>Re: Is HDFS Snapshot applicable to a very large file such TB?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Is-HDFS-Snapshot-applicable-to-a-very-large-file-such-TB/m-p/243490#M205288</link>
      <description>&lt;P&gt;&lt;A rel="noopener noreferrer noopener noreferrer" href="http://@Hamilton%20Castro" target="_blank"&gt;&lt;EM&gt;@Hamilton Castro&lt;/EM&gt;&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;The simple and clear answer is "&lt;STRONG&gt;YES&lt;/STRONG&gt;" !!&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;HDFS Snapshots are read-only point-in-time copies of the file system. They can be taken on any level of the file system. Snapshot is valuable as a backup or for Business continuity plans as a Disaster recovery option.&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;&lt;BR /&gt;&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;The concept of a snapshot can be considered&lt;STRONG&gt; Point-in-Time&lt;/STRONG&gt; [PIT] backup, which is wrong if you had a 5TB the snapshot will not be the same size, an HDFS snapshot is not a full copy of the data, rather a copy of the metadata at that point in time. Blocks in data nodes are not copied: the snapshot files record the block list and the file size. There is no data copying  (more accurately a new record in the inode). It's only on modifications (appends and truncates for HDFS) that record any data.&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;&lt;BR /&gt;&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;The snapshot data is computed by subtracting the modifications from the current data. The modifications are recorded in chronological order, so that the current data can be accessed directly. To take snapshots, the HDFS directory has to be set as a snapshot table. If there are snapshots in a &lt;STRONG&gt;snapshottable &lt;/STRONG&gt;directory, the directory cannot be deleted nor renamed.&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;&lt;BR /&gt;&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;So when you first take a snapshot, your HDFS storage usage will stay the same. It is only when you modify the data that data is copied/written. Copying data between clusters or storage systems, copying a snapshotted file is no different than copying a regular file - they both will copy the same way, with bytes and with metadata. There's no "copy only metadata" operation.&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 30 Jun 2019 01:48:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Is-HDFS-Snapshot-applicable-to-a-very-large-file-such-TB/m-p/243490#M205288</guid>
      <dc:creator>Shelton</dc:creator>
      <dc:date>2019-06-30T01:48:02Z</dc:date>
    </item>
    <item>
      <title>Re: Is HDFS Snapshot applicable to a very large file such TB?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Is-HDFS-Snapshot-applicable-to-a-very-large-file-such-TB/m-p/243491#M205289</link>
      <description>&lt;P&gt;The above question and the entire reoply thread below was originally posted in the &lt;A href="https://community.hortonworks.com/spaces/101/index.html"&gt;Community Help&lt;/A&gt; track. On Sun Jun 30 17:30 UTC 2019, a member of the HCC moderation staff moved it to the &lt;A href="https://community.hortonworks.com/spaces/65/core-track_2.html"&gt;Hadoop Core&lt;/A&gt; track. The &lt;EM&gt;Community Help Track&lt;/EM&gt; is intended for questions about using the HCC site itself, not technical questions about HDFS.&lt;/P&gt;</description>
      <pubDate>Mon, 01 Jul 2019 00:31:59 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Is-HDFS-Snapshot-applicable-to-a-very-large-file-such-TB/m-p/243491#M205289</guid>
      <dc:creator>ask_bill_brooks</dc:creator>
      <dc:date>2019-07-01T00:31:59Z</dc:date>
    </item>
  </channel>
</rss>

