<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Upgrade or Fresh install in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Upgrade-or-Fresh-install/m-p/205586#M167552</link>
    <description>&lt;P&gt;&lt;EM&gt;&lt;A href="@Lenu K"&gt; @Lenu K&lt;/A&gt;&lt;BR /&gt;&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Your question is rather wide for a small cluster all depends on manpower at hand, for HDF remember to back up the flow files, below are immediately what comes into my mind.&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;&lt;B&gt;Fresh Install pros and con's &lt;/B&gt;&lt;BR /&gt;&lt;/EM&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;EM&gt; Better planned &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Here you get a clean installation maybe properly configured mistakes learned from the current cluster setup. &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Straightforward no upgrade surprises. &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Loose Customization &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;&lt;BR /&gt;&lt;/EM&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;STRONG&gt;&lt;EM&gt;Upgrade pros and cons' &lt;/EM&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;EM&gt;Must plan properly and document steps &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Expect technical surprises and challenge. &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Plan support if not having one already on the D-day &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Challenges mold you to a better hadoopist!&lt;BR /&gt;&lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;See &lt;A href="https://docs.hortonworks.com/HDPDocuments/Ambari-2.6.2.0/bk_ambari-upgrade/content/post_ambari_upgrade_tasks.html" target="_blank"&gt;Mandatory Post-Upgrade Tasks&lt;/A&gt;&lt;BR /&gt;&lt;/EM&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;STRONG&gt;&lt;EM&gt;Best practice &lt;/EM&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;EM&gt; Verify that the file system you selected is supported HWX &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Pre-create all the databases&lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Backup your cluster before either of the above. &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Plan for at least NN/RM HA (NN are the brain so allocate good memory) MUST have 3 Zookeeper &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;HDD planning is important SSD for SCSI &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Restrict access to the cluster from the ONLY edge node. &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Kerberize the Cluster &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Configure SSL think of SSD for Zk,Hbase and OS can also use the SSD acceleration for temp tables in hive, exposing the SSD via HDFS &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Plan well the Data center network(Backup lines) &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Size your nodes memory and storage properly. &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Beware if performance is a must especially with Kafka and Storm are memory intensive. &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Delegate authorization to Ranger. &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Test upgrade procedures for new versions of existing components &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Execute performance tests of custom-built applications &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Allow end-users to perform user acceptance testing &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Execute integration tests where custom-built applications communicate with third-party software &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Experiment with new software that is beta quality and may not be ready for usage at all &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Execute security penetration tests (typically done by an external company) &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Let application developers modify configuration parameters and restart services on short notice &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Maintain a mirror image of the production environment to be activated in case of natural disaster or unforeseen events &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Execute regression tests that compare the outputs of new application code with existing code running in production&lt;/EM&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;EM&gt;HTH&lt;/EM&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 04 Oct 2018 04:01:34 GMT</pubDate>
    <dc:creator>Shelton</dc:creator>
    <dc:date>2018-10-04T04:01:34Z</dc:date>
    <item>
      <title>Upgrade or Fresh install</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Upgrade-or-Fresh-install/m-p/205585#M167551</link>
      <description>&lt;P&gt;Hi All, &lt;/P&gt;&lt;P&gt;We have 10 node cluster. We have only few teams are using the cluster at the moment.What do you suggest fresh install or Upgrade . If so why ? Could you please explain the pain points of both . What is the best practice here . &lt;/P&gt;&lt;P&gt;Clean installation of OS, HDP , HDF or upgrade of HDP and HDF . If fresh install . we will take a back of all the data to another machine and reinstall everything . &lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 13:46:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Upgrade-or-Fresh-install/m-p/205585#M167551</guid>
      <dc:creator>lenu</dc:creator>
      <dc:date>2022-09-16T13:46:21Z</dc:date>
    </item>
    <item>
      <title>Re: Upgrade or Fresh install</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Upgrade-or-Fresh-install/m-p/205586#M167552</link>
      <description>&lt;P&gt;&lt;EM&gt;&lt;A href="@Lenu K"&gt; @Lenu K&lt;/A&gt;&lt;BR /&gt;&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Your question is rather wide for a small cluster all depends on manpower at hand, for HDF remember to back up the flow files, below are immediately what comes into my mind.&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;&lt;B&gt;Fresh Install pros and con's &lt;/B&gt;&lt;BR /&gt;&lt;/EM&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;EM&gt; Better planned &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Here you get a clean installation maybe properly configured mistakes learned from the current cluster setup. &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Straightforward no upgrade surprises. &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Loose Customization &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;&lt;BR /&gt;&lt;/EM&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;STRONG&gt;&lt;EM&gt;Upgrade pros and cons' &lt;/EM&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;EM&gt;Must plan properly and document steps &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Expect technical surprises and challenge. &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Plan support if not having one already on the D-day &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Challenges mold you to a better hadoopist!&lt;BR /&gt;&lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;See &lt;A href="https://docs.hortonworks.com/HDPDocuments/Ambari-2.6.2.0/bk_ambari-upgrade/content/post_ambari_upgrade_tasks.html" target="_blank"&gt;Mandatory Post-Upgrade Tasks&lt;/A&gt;&lt;BR /&gt;&lt;/EM&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;STRONG&gt;&lt;EM&gt;Best practice &lt;/EM&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;EM&gt; Verify that the file system you selected is supported HWX &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Pre-create all the databases&lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Backup your cluster before either of the above. &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Plan for at least NN/RM HA (NN are the brain so allocate good memory) MUST have 3 Zookeeper &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;HDD planning is important SSD for SCSI &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Restrict access to the cluster from the ONLY edge node. &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Kerberize the Cluster &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Configure SSL think of SSD for Zk,Hbase and OS can also use the SSD acceleration for temp tables in hive, exposing the SSD via HDFS &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Plan well the Data center network(Backup lines) &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Size your nodes memory and storage properly. &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Beware if performance is a must especially with Kafka and Storm are memory intensive. &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Delegate authorization to Ranger. &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Test upgrade procedures for new versions of existing components &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Execute performance tests of custom-built applications &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Allow end-users to perform user acceptance testing &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Execute integration tests where custom-built applications communicate with third-party software &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Experiment with new software that is beta quality and may not be ready for usage at all &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Execute security penetration tests (typically done by an external company) &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Let application developers modify configuration parameters and restart services on short notice &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Maintain a mirror image of the production environment to be activated in case of natural disaster or unforeseen events &lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;&lt;EM&gt;Execute regression tests that compare the outputs of new application code with existing code running in production&lt;/EM&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;EM&gt;HTH&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 04 Oct 2018 04:01:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Upgrade-or-Fresh-install/m-p/205586#M167552</guid>
      <dc:creator>Shelton</dc:creator>
      <dc:date>2018-10-04T04:01:34Z</dc:date>
    </item>
  </channel>
</rss>

