<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Backing up HDFS production data in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Backing-up-HDFS-production-data/m-p/122411#M43138</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/113/jstraub.html" nodeid="113"&gt;@Jonas Straub&lt;/A&gt;, &lt;A rel="user" href="https://community.cloudera.com/users/104/sball.html" nodeid="104"&gt;@Simon Elliston Ball&lt;/A&gt;, &lt;A rel="user" href="https://community.cloudera.com/users/214/agillan.html" nodeid="214"&gt;@Ana Gillan&lt;/A&gt;,&lt;A rel="user" href="https://community.cloudera.com/users/238/gbraccialli.html" nodeid="238"&gt;@Guilherme Braccialli&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 10 Oct 2016 17:50:15 GMT</pubDate>
    <dc:creator>smartninja723</dc:creator>
    <dc:date>2016-10-10T17:50:15Z</dc:date>
    <item>
      <title>Backing up HDFS production data</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Backing-up-HDFS-production-data/m-p/122410#M43137</link>
      <description>&lt;P&gt;Hi experts,&lt;/P&gt;&lt;P&gt;This question is mostly related to DR and backup.&lt;/P&gt;&lt;P&gt;We already have two clusters ( where are exactly same in configuration and one is master and another is hot standby). To mitigate the risk further, we think of a 'cold backup', where we can store the HDFS data just like previous tape based backup solutions. And want to have this stored in our data center. (not on cloud) &lt;/P&gt;&lt;P&gt;We do not want to invest another cluster and use distcp based approach. Want to backup only hdfs data.&lt;/P&gt;&lt;P&gt;What could be the best solution/approach/design around the same.&lt;/P&gt;&lt;P&gt;Let me know if more inputs required.&lt;/P&gt;&lt;P&gt;Many thanks,&lt;/P&gt;&lt;P&gt;SS&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 10:43:46 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Backing-up-HDFS-production-data/m-p/122410#M43137</guid>
      <dc:creator>smartninja723</dc:creator>
      <dc:date>2022-09-16T10:43:46Z</dc:date>
    </item>
    <item>
      <title>Re: Backing up HDFS production data</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Backing-up-HDFS-production-data/m-p/122411#M43138</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/113/jstraub.html" nodeid="113"&gt;@Jonas Straub&lt;/A&gt;, &lt;A rel="user" href="https://community.cloudera.com/users/104/sball.html" nodeid="104"&gt;@Simon Elliston Ball&lt;/A&gt;, &lt;A rel="user" href="https://community.cloudera.com/users/214/agillan.html" nodeid="214"&gt;@Ana Gillan&lt;/A&gt;,&lt;A rel="user" href="https://community.cloudera.com/users/238/gbraccialli.html" nodeid="238"&gt;@Guilherme Braccialli&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 10 Oct 2016 17:50:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Backing-up-HDFS-production-data/m-p/122411#M43138</guid>
      <dc:creator>smartninja723</dc:creator>
      <dc:date>2016-10-10T17:50:15Z</dc:date>
    </item>
    <item>
      <title>Re: Backing up HDFS production data</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Backing-up-HDFS-production-data/m-p/122412#M43139</link>
      <description>&lt;P&gt;@ Smart Solution&lt;/P&gt;&lt;P&gt;Please refer the link if this helps you :-&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.hortonworks.com/articles/43525/disaster-recovery-and-backup-best-practices-in-a-t.html" target="_blank"&gt;https://community.hortonworks.com/articles/43525/disaster-recovery-and-backup-best-practices-in-a-t.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 11 Oct 2016 02:53:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Backing-up-HDFS-production-data/m-p/122412#M43139</guid>
      <dc:creator>ashneesharma88</dc:creator>
      <dc:date>2016-10-11T02:53:28Z</dc:date>
    </item>
    <item>
      <title>Re: Backing up HDFS production data</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Backing-up-HDFS-production-data/m-p/122413#M43140</link>
      <description>&lt;P&gt;@&lt;A href="https://community.hortonworks.com/users/3021/smartninja723.html"&gt;Smart Solutions&lt;/A&gt;&lt;/P&gt;&lt;P&gt;The two main options for replicating the HDFS structure are Falcon and distcp. The distcp command is not very feature rich, you give it a path in the HDFS structure and a destination cluster and it will copy everything to the same path on the destination. If the copy fails, you will need to start it again, etc.&lt;/P&gt;&lt;P&gt;Another method for maintaining a replica of your HDFS structure is Falcon. There are more data movement options and you can more effectively manage the lifecycle of all of the data on both sides.&lt;/P&gt;&lt;P&gt;If you're moving Hive table structures, there is some more complexity to making sure the tables are created on the DR side, but moving the actual files is done the same way&lt;/P&gt;&lt;P&gt;You excluded distcp as an option. As such, I suggest to look at Falcon.&lt;/P&gt;&lt;P&gt;Check this: &lt;A href="http://hortonworks.com/hadoop-tutorial/mirroring-datasets-between-hadoop-clusters-with-apache-falcon/" target="_blank"&gt;http://hortonworks.com/hadoop-tutorial/mirroring-datasets-between-hadoop-clusters-with-apache-falcon/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;+++++++&lt;/P&gt;&lt;P&gt;if any response addressed your question, please vote and accept best answer.&lt;/P&gt;</description>
      <pubDate>Tue, 11 Oct 2016 08:23:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Backing-up-HDFS-production-data/m-p/122413#M43140</guid>
      <dc:creator>cstanca</dc:creator>
      <dc:date>2016-10-11T08:23:07Z</dc:date>
    </item>
  </channel>
</rss>

