<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Problem with starting CDH cluster on AWS using Cloudera Manager in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Problem-with-starting-CDH-cluster-on-AWS-using-Cloudera/m-p/44719#M42199</link>
    <description>&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;Hi,&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;Thanks for your reply.&amp;nbsp;I can definitely access the data after start and stop of my instances. In my case, my m3.xlarge instances are attached with an EBS storage device : both my boot and block devices are attached to the same ebs volume. That's also what makes it possible to stop and start the instances.&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;Also, as you can read&amp;nbsp;in my initial post, I'm using Cloudera Director and Cloudera Manager for the deployment/management of my CDH cluster.&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;At this stage, I still do not see what's causing the issues I have mentionned above.&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;Regards.&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Sun, 04 Sep 2016 17:15:22 GMT</pubDate>
    <dc:creator>pgb</dc:creator>
    <dc:date>2016-09-04T17:15:22Z</dc:date>
    <item>
      <title>Problem with starting CDH cluster on AWS using Cloudera Manager</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Problem-with-starting-CDH-cluster-on-AWS-using-Cloudera/m-p/44610#M42197</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I'm running a POC on AWS using CDH 5.7.2. I have created and configure a simple&amp;nbsp;environment using Cloudera Director as follow :&lt;/P&gt;
&lt;P&gt;Cloudera manager&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&lt;SPAN&gt;1 x Master&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&lt;SPAN&gt;3 x Workers&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&lt;SPAN&gt;1 x Gateway&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&lt;SPAN&gt;All the 6 instances are&amp;nbsp;m3.xlarge instance type. The installation is smooth and straight foward using cloudera director. After running my jobs for the POC, I stop the cluster from cloudera manager and then stop the instances on EC2 dashboard.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&lt;SPAN&gt;When I restart the instances and the cluster, I always get the following error in various order :&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN&gt;&lt;SPAN class="bold"&gt;Bad&lt;/SPAN&gt; :&lt;/SPAN&gt; &lt;SPAN class="bold"&gt;659 missing blocks in the cluster. 986 total blocks in the cluster. Percentage missing blocks: 66.84%. Critical threshold: any.&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN class="bold"&gt;&lt;SPAN&gt;&lt;SPAN class="bold"&gt;Bad&lt;/SPAN&gt; :&lt;/SPAN&gt; &lt;SPAN class="bold"&gt;659 under replicated blocks in the cluster. 986 total blocks in the cluster. Percentage under replicated blocks: 66.84%. Critical threshold: 40.00%.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;SPAN class="bold"&gt;&lt;SPAN class="bold"&gt;Event Server Down&amp;nbsp;(I have to manually start)&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;PRE&gt;Exception while getting fetch configDefaults hash: none
java.net.ConnectException: Connection refused&lt;/PRE&gt;
&lt;PRE&gt;Failed to publish event: SimpleEvent{attributes={STACKTRACE=[java.net.ConnectExcepion: Connection refused&lt;/PRE&gt;
&lt;PRE&gt;ERROR   com.cloudera.cmf.eventcatcher.server.EventCatcherService   Could not fetch descriptor after 5 tries, exiting.&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Host monitor Down (I have to manually start)&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&lt;SPAN&gt;&lt;SPAN class="bold"&gt;&lt;SPAN class="bold"&gt;&lt;SPAN&gt;I consistantly reproduce these errors for every fresh installations I have done:&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&lt;SPAN&gt;&lt;SPAN class="bold"&gt;&lt;SPAN class="bold"&gt;&lt;SPAN&gt;- At first, all green light&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&lt;SPAN&gt;&lt;SPAN class="bold"&gt;&lt;SPAN class="bold"&gt;&lt;SPAN&gt;- After stopping the cluster/instances and restarting these errors occur&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Is there anything wrong with the approach I use to stop &amp;amp; start my cluster ? I've started googling a bit around the missing block issue and understant that it may be related to corrupted files. How to prevent this issue from happening ? Any best practices are welcomed...&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I've realized that I'm spending more than half of my time actually fixing the environment instead of focusing on my POC.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 10:37:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Problem-with-starting-CDH-cluster-on-AWS-using-Cloudera/m-p/44610#M42197</guid>
      <dc:creator>pgb</dc:creator>
      <dc:date>2022-09-16T10:37:35Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with starting CDH cluster on AWS using Cloudera Manager</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Problem-with-starting-CDH-cluster-on-AWS-using-Cloudera/m-p/44619#M42198</link>
      <description>As you can note on &lt;A href="https://aws.amazon.com/ec2/instance-types/" target="_blank"&gt;https://aws.amazon.com/ec2/instance-types/&lt;/A&gt; and &lt;A href="http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/InstanceStorage.html#instance-store-lifetime" target="_blank"&gt;http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/InstanceStorage.html#instance-store-lifetime&lt;/A&gt;, the m3.xlarge uses 2x "instance store" type disks, which will be entirely destroyed when you stop an instance. When you bring back your instance, it would not have any of its past persisted data, and that's not acceptable to a lot of CM and CDH components. Your blocks on HDFS would no longer be on the disk so they'd be reported as missing too.&lt;BR /&gt;&lt;BR /&gt;You should instead use instances that provide "EBS" storage so the data persists.&lt;BR /&gt;&lt;BR /&gt;For cloud environment deployments we recommend using Cloudera Director to install, deploy and run your Cloudera CM and CDH cluster instead of manually managing it, to avoid the little problems such as these: &lt;A href="https://www.cloudera.com/documentation/director/latest/topics/director_intro.html" target="_blank"&gt;https://www.cloudera.com/documentation/director/latest/topics/director_intro.html&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;You can also checkout what instance types are recommended by Cloudera Director for CM and CDH here: &lt;A href="https://www.cloudera.com/documentation/director/latest/topics/director_deployment_requirements.html#concept_fhh_ygd_nt_a" target="_blank"&gt;https://www.cloudera.com/documentation/director/latest/topics/director_deployment_requirements.html#concept_fhh_ygd_nt_a&lt;/A&gt;</description>
      <pubDate>Thu, 01 Sep 2016 08:22:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Problem-with-starting-CDH-cluster-on-AWS-using-Cloudera/m-p/44619#M42198</guid>
      <dc:creator>Harsh J</dc:creator>
      <dc:date>2016-09-01T08:22:02Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with starting CDH cluster on AWS using Cloudera Manager</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Problem-with-starting-CDH-cluster-on-AWS-using-Cloudera/m-p/44719#M42199</link>
      <description>&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;Hi,&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;Thanks for your reply.&amp;nbsp;I can definitely access the data after start and stop of my instances. In my case, my m3.xlarge instances are attached with an EBS storage device : both my boot and block devices are attached to the same ebs volume. That's also what makes it possible to stop and start the instances.&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;Also, as you can read&amp;nbsp;in my initial post, I'm using Cloudera Director and Cloudera Manager for the deployment/management of my CDH cluster.&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;At this stage, I still do not see what's causing the issues I have mentionned above.&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;Regards.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 04 Sep 2016 17:15:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Problem-with-starting-CDH-cluster-on-AWS-using-Cloudera/m-p/44719#M42199</guid>
      <dc:creator>pgb</dc:creator>
      <dc:date>2016-09-04T17:15:22Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with starting CDH cluster on AWS using Cloudera Manager</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Problem-with-starting-CDH-cluster-on-AWS-using-Cloudera/m-p/44721#M42200</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Are you sure that the blocks are still existing in the DataNode hosts even after rebooting the instances? By default, the location should be under /dfs/dn{1,.2..}.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 05 Sep 2016 01:23:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Problem-with-starting-CDH-cluster-on-AWS-using-Cloudera/m-p/44721#M42200</guid>
      <dc:creator>dice</dc:creator>
      <dc:date>2016-09-05T01:23:05Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with starting CDH cluster on AWS using Cloudera Manager</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Problem-with-starting-CDH-cluster-on-AWS-using-Cloudera/m-p/44937#M42201</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I did install from scratch a new cluster using m4 instance type and I could not reproduce the error.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;</description>
      <pubDate>Fri, 09 Sep 2016 16:34:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Problem-with-starting-CDH-cluster-on-AWS-using-Cloudera/m-p/44937#M42201</guid>
      <dc:creator>pgb</dc:creator>
      <dc:date>2016-09-09T16:34:44Z</dc:date>
    </item>
  </channel>
</rss>

