<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: what could be the cause for spark2-hdp-yarn-archive.tar.gz  corruption in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/what-could-be-the-cause-for-spark2-hdp-yarn-archive-tar-gz/m-p/180992#M83112</link>
    <description>&lt;P&gt;@Jay , very nice solution &lt;/P&gt;&lt;P&gt;until now I was doing this , in ordeer to verify the file&lt;/P&gt;&lt;PRE&gt;gzip -t /var/tmp/spark2-hdp-yarn-archive.tar.gz&lt;BR /&gt;gunzip -c /var/tmp/spark2-hdp-yarn-archive.tar.gz | tar t &amp;gt; /dev/null&lt;BR /&gt;&lt;CODE&gt;tar tzvf spark2-hdp-yarn-archive.tar.gz  &amp;gt; /dev/null&lt;/CODE&gt;&lt;BR /&gt;&lt;/PRE&gt;</description>
    <pubDate>Wed, 05 Sep 2018 12:29:42 GMT</pubDate>
    <dc:creator>mike_bronson7</dc:creator>
    <dc:date>2018-09-05T12:29:42Z</dc:date>
    <item>
      <title>what could be the cause for spark2-hdp-yarn-archive.tar.gz  corruption</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/what-could-be-the-cause-for-spark2-hdp-yarn-archive-tar-gz/m-p/180988#M83108</link>
      <description>&lt;P&gt;hi all,&lt;/P&gt;&lt;P&gt;we installed new hadoop cluster ( ambari + HDP version 2.6.4 )&lt;/P&gt;&lt;P&gt;after installation , we notice that we have problem with the spark-submit&lt;/P&gt;&lt;P&gt;and finally we found that &lt;STRONG&gt;spark2-hdp-yarn-archive.tar.gz&lt;/STRONG&gt; file is corruption&lt;/P&gt;&lt;P&gt;full path - /hdp/apps/2.6.4.0-91/spark2/spark2-hdp-yarn-archive.tar.gz  ( from HDFS )&lt;/P&gt;&lt;P&gt;my question is -  what could be the reason that this is is corrupted ? &lt;/P&gt;&lt;P&gt;in spite this cluster is new fresh installation &lt;/P&gt;</description>
      <pubDate>Wed, 05 Sep 2018 00:33:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/what-could-be-the-cause-for-spark2-hdp-yarn-archive-tar-gz/m-p/180988#M83108</guid>
      <dc:creator>mike_bronson7</dc:creator>
      <dc:date>2018-09-05T00:33:58Z</dc:date>
    </item>
    <item>
      <title>Re: what could be the cause for spark2-hdp-yarn-archive.tar.gz  corruption</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/what-could-be-the-cause-for-spark2-hdp-yarn-archive-tar-gz/m-p/180989#M83109</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/26229/uribarih.html" nodeid="26229"&gt;@Michael Bronson&lt;/A&gt;&lt;/P&gt;&lt;P&gt;What kind of corruption is that? file is incomplete or less size that it should be?&lt;/P&gt;</description>
      <pubDate>Wed, 05 Sep 2018 01:24:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/what-could-be-the-cause-for-spark2-hdp-yarn-archive-tar-gz/m-p/180989#M83109</guid>
      <dc:creator>apappu</dc:creator>
      <dc:date>2018-09-05T01:24:20Z</dc:date>
    </item>
    <item>
      <title>Re: what could be the cause for spark2-hdp-yarn-archive.tar.gz  corruption</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/what-could-be-the-cause-for-spark2-hdp-yarn-archive-tar-gz/m-p/180990#M83110</link>
      <description>&lt;P&gt;I cant tell you exaclty  but after I tar again the files , this solve my problem &lt;/P&gt;</description>
      <pubDate>Wed, 05 Sep 2018 02:33:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/what-could-be-the-cause-for-spark2-hdp-yarn-archive-tar-gz/m-p/180990#M83110</guid>
      <dc:creator>mike_bronson7</dc:creator>
      <dc:date>2018-09-05T02:33:47Z</dc:date>
    </item>
    <item>
      <title>Re: what could be the cause for spark2-hdp-yarn-archive.tar.gz  corruption</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/what-could-be-the-cause-for-spark2-hdp-yarn-archive-tar-gz/m-p/180991#M83111</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/26229/uribarih.html" nodeid="26229"&gt;@Michael Bronson&lt;/A&gt;&lt;/P&gt;&lt;P&gt;As the file path which you shared is on HDFS : &lt;STRONG&gt;/hdp/apps/2.6.4.0-91/spark2/spark2-hdp-yarn-archive.tar.gz &lt;/STRONG&gt;&lt;BR /&gt;&lt;BR /&gt;To identify "corrupt" or "missing" blocks, the command-line command &lt;STRONG&gt; &lt;/STRONG&gt;can be used to knwo whether it is healthy or not?&lt;/P&gt;&lt;PRE&gt;# su - hdfs -c "hdfs fsck /hdp/apps/2.6.4.0-91/spark2/spark2-hdp-yarn-archive.tar.gz"
.
Connecting to namenode via &lt;A href="http://hdfcluster2.example.com:50070/fsck?ugi=hdfs&amp;amp;path=%2Fhdp%2Fapps%2F2.6.4.0-91%2Fspark2%2Fspark2-hdp-yarn-archive.tar.gz" target="_blank"&gt;http://hdfcluster2.example.com:50070/fsck?ugi=hdfs&amp;amp;path=%2Fhdp%2Fapps%2F2.6.4.0-91%2Fspark2%2Fspark2-hdp-yarn-archive.tar.gz&lt;/A&gt;
FSCK started by hdfs (auth:SIMPLE) from /172.22.197.159 for path /hdp/apps/2.6.4.0-91/spark2/spark2-hdp-yarn-archive.tar.gz at Wed Sep 05 01:51:25 UTC 2018
.Status: HEALTHY
 Total size:    189997800 B
 Total dirs:    0
 Total files:    1
 Total symlinks:        0
 Total blocks (validated):    2 (avg. block size 94998900 B)
 Minimally replicated blocks:    2 (100.0 %)
 Over-replicated blocks:    0 (0.0 %)
 Under-replicated blocks:    0 (0.0 %)
 Mis-replicated blocks:        0 (0.0 %)
 Default replication factor:    3
 Average block replication:    3.0
 Corrupt blocks:        0
 Missing replicas:        0 (0.0 %)
 Number of data-nodes:        4
 Number of racks:        1
FSCK ended at Wed Sep 05 01:51:25 UTC 2018 in 35 milliseconds
The filesystem under path '/hdp/apps/2.6.4.0-91/spark2/spark2-hdp-yarn-archive.tar.gz' is HEALTHY&lt;/PRE&gt;&lt;P&gt;HDFS will attempt to recover the situation automatically. By default there are three replicas of any block in the cluster. so if HDFS detects that one replica of a block has become corrupt or damaged, HDFS will create a new replica of that block from a known-good replica, and will mark the damaged one for deletion.&lt;BR /&gt;&lt;BR /&gt;The chances of three replicas of the same block becoming damaged is so remote that it would suggest a significant failure somewhere else in the cluster. If this situation does occur, and all three replicas are damaged, then 'hdfs fsck' will report that block as "corrupt" - i.e. HDFS cannot self-heal the block from any of its replicas.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Although there are some Articles which can be referred to fix the "Under replicated Blocks" like: &lt;BR /&gt;&lt;A href="https://community.hortonworks.com/articles/4427/fix-under-replicated-blocks-in-hdfs-manually.html" target="_blank"&gt;https://community.hortonworks.com/articles/4427/fix-under-replicated-blocks-in-hdfs-manually.html&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;How to fix missing/corrupted/under or over-replicated blocks?&lt;BR /&gt;&lt;A href="https://community.hortonworks.com/content/supportkb/49106/how-to-fix-missingcorruptedunder-or-over-replicate.html" target="_blank"&gt;https://community.hortonworks.com/content/supportkb/49106/how-to-fix-missingcorruptedunder-or-over-replicate.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;.&lt;/P&gt;</description>
      <pubDate>Wed, 05 Sep 2018 08:52:26 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/what-could-be-the-cause-for-spark2-hdp-yarn-archive-tar-gz/m-p/180991#M83111</guid>
      <dc:creator>jsensharma</dc:creator>
      <dc:date>2018-09-05T08:52:26Z</dc:date>
    </item>
    <item>
      <title>Re: what could be the cause for spark2-hdp-yarn-archive.tar.gz  corruption</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/what-could-be-the-cause-for-spark2-hdp-yarn-archive-tar-gz/m-p/180992#M83112</link>
      <description>&lt;P&gt;@Jay , very nice solution &lt;/P&gt;&lt;P&gt;until now I was doing this , in ordeer to verify the file&lt;/P&gt;&lt;PRE&gt;gzip -t /var/tmp/spark2-hdp-yarn-archive.tar.gz&lt;BR /&gt;gunzip -c /var/tmp/spark2-hdp-yarn-archive.tar.gz | tar t &amp;gt; /dev/null&lt;BR /&gt;&lt;CODE&gt;tar tzvf spark2-hdp-yarn-archive.tar.gz  &amp;gt; /dev/null&lt;/CODE&gt;&lt;BR /&gt;&lt;/PRE&gt;</description>
      <pubDate>Wed, 05 Sep 2018 12:29:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/what-could-be-the-cause-for-spark2-hdp-yarn-archive-tar-gz/m-p/180992#M83112</guid>
      <dc:creator>mike_bronson7</dc:creator>
      <dc:date>2018-09-05T12:29:42Z</dc:date>
    </item>
    <item>
      <title>Re: what could be the cause for spark2-hdp-yarn-archive.tar.gz  corruption</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/what-could-be-the-cause-for-spark2-hdp-yarn-archive-tar-gz/m-p/180993#M83113</link>
      <description>&lt;P&gt;@Jay in spite
&lt;/P&gt;&lt;P&gt; this is diff case , I post yesterday the thred - &lt;A href="https://community.hortonworks.com/questions/217423/spark-application-communicating-with-driver-in-hea.html" target="_blank"&gt;https://community.hortonworks.com/questions/217423/spark-application-communicating-with-driver-in-hea.html&lt;/A&gt; , can you help me with this ?&lt;/P&gt;</description>
      <pubDate>Wed, 05 Sep 2018 12:33:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/what-could-be-the-cause-for-spark2-hdp-yarn-archive-tar-gz/m-p/180993#M83113</guid>
      <dc:creator>mike_bronson7</dc:creator>
      <dc:date>2018-09-05T12:33:20Z</dc:date>
    </item>
    <item>
      <title>Re: what could be the cause for spark2-hdp-yarn-archive.tar.gz  corruption</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/what-could-be-the-cause-for-spark2-hdp-yarn-archive-tar-gz/m-p/180994#M83114</link>
      <description>&lt;P&gt;@Jay . please let me know if I understand it as the following &lt;/P&gt;&lt;P&gt;&lt;EM&gt;let say that one of the replica spark2-hdp-yarn-archive.tar.gz , is corrupted &lt;/EM&gt;&lt;/P&gt;&lt;P&gt;when I run this CLI  &lt;STRONG&gt;su - hdfs -c "hdfs fsck /hdp/apps/2.6.4.0-91/spark2/spark2-hdp-yarn-archive.tar.gz"&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;dose its actually means that fsck will replace the bad one with the good one and status finally will be  HEALTHY ?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 05 Sep 2018 15:06:59 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/what-could-be-the-cause-for-spark2-hdp-yarn-archive-tar-gz/m-p/180994#M83114</guid>
      <dc:creator>mike_bronson7</dc:creator>
      <dc:date>2018-09-05T15:06:59Z</dc:date>
    </item>
  </channel>
</rss>

