<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: how to format HDFS in an already installed cluster, properly ? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-format-HDFS-in-an-already-installed-cluster-properly/m-p/106649#M15881</link>
    <description>&lt;A rel="user" href="https://community.cloudera.com/users/1198/koenigbodensee.html" nodeid="1198"&gt;@Gerd Koenig&lt;/A&gt;&lt;P&gt;additionally, the files you're concerned with are distributed with our distribution, you can find them in /usr/hdp directory on your local filesystem.&lt;/P&gt;</description>
    <pubDate>Thu, 21 Jan 2016 22:36:50 GMT</pubDate>
    <dc:creator>aervits</dc:creator>
    <dc:date>2016-01-21T22:36:50Z</dc:date>
    <item>
      <title>how to format HDFS in an already installed cluster, properly ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-format-HDFS-in-an-already-installed-cluster-properly/m-p/106647#M15879</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;during installation procedure of a cluster I was facing some hw issues, so that at the end I now have a (almost) running cluster but with corrupt file blocks.&lt;/P&gt;&lt;P&gt;HDFS service is up and running in HA mode but it is complaining about corrupt blocks:&lt;/P&gt;&lt;PRE&gt;FSCK started by hdfs (auth:SIMPLE) from /10.41.27.10 for path / at Thu Jan 21 13:22:00 CET 2016
..............
/hdp/apps/2.2.4.2-2/hive/hive.tar.gz: CORRUPT blockpool BP-1565025838-10.41.27.10-1452263064113 block blk_1073741862


/hdp/apps/2.2.4.2-2/hive/hive.tar.gz: MISSING 1 blocks of total size 83000677 B..
/hdp/apps/2.2.4.2-2/mapreduce/hadoop-streaming.jar: CORRUPT blockpool BP-1565025838-10.41.27.10-1452263064113 block blk_1073741863


/hdp/apps/2.2.4.2-2/mapreduce/hadoop-streaming.jar: MISSING 1 blocks of total size 104996 B..
/hdp/apps/2.2.4.2-2/mapreduce/mapreduce.tar.gz: CORRUPT blockpool BP-1565025838-10.41.27.10-1452263064113 block blk_1073741827


/hdp/apps/2.2.4.2-2/mapreduce/mapreduce.tar.gz: CORRUPT blockpool BP-1565025838-10.41.27.10-1452263064113 block blk_1073741829


/hdp/apps/2.2.4.2-2/mapreduce/mapreduce.tar.gz: MISSING 2 blocks of total size 192697367 B..
/hdp/apps/2.2.4.2-2/pig/pig.tar.gz: CORRUPT blockpool BP-1565025838-10.41.27.10-1452263064113 block blk_1073741861


/hdp/apps/2.2.4.2-2/pig/pig.tar.gz: MISSING 1 blocks of total size 97548644 B..
/hdp/apps/2.2.4.2-2/tez/tez.tar.gz: CORRUPT blockpool BP-1565025838-10.41.27.10-1452263064113 block blk_1073741826


/hdp/apps/2.2.4.2-2/tez/tez.tar.gz: MISSING 1 blocks of total size 40658186 B..
/mr-history/done/2016/01/08/000000/job_1452263100546_0003-1452263260432-ambari%2Dqa-PigLatin%3ApigSmoke.sh-1452263277399-1-0-SUCCEEDED-default-1452263269870.jhist: CORRUPT blockpool BP-1565025838-10.41.27.10-1452263064113 block blk_1073742129



...

/user/ambari-qa/passwd: MISSING 1 blocks of total size 2637 B...
/user/ambari-qa/pigsmoke.out/part-v000-o000-r-00000: CORRUPT blockpool BP-1565025838-10.41.27.10-1452263064113 block blk_1073742141
/user/ambari-qa/pigsmoke.out/part-v000-o000-r-00000: MISSING 1 blocks of total size 358 B.Status: CORRUPT
 Total size:    414892275 B
 Total dirs:    7291
 Total files:   38
 Total symlinks:                0
 Total blocks (validated):      35 (avg. block size 11854065 B)
  ********************************
  CORRUPT FILES:        23
  MISSING BLOCKS:       24
  MISSING SIZE:         414887859 B
  CORRUPT BLOCKS:       24
  ********************************
 Minimally replicated blocks:   11 (31.428572 %)
 Over-replicated blocks:        0 (0.0 %)
 Under-replicated blocks:       0 (0.0 %)
 Mis-replicated blocks:         0 (0.0 %)
 Default replication factor:    2
 Average block replication:     0.62857145
 Corrupt blocks:                24
 Missing replicas:              0 (0.0 %)
 Number of data-nodes:          4
 Number of racks:               1
FSCK ended at Thu Jan 21 13:22:00 CET 2016 in 157 milliseconds
The filesystem under path '/' is CORRUPT&lt;/PRE&gt;&lt;P&gt;What I want to do now is to re-format HDFS to start with a blank HDFS, since it is a new installation and no data has been uploaded to HDFS.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;How can I properly re-format HDFS to get rid of the corrupt blocks ?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;I am afraid of deleting just the files it is complaining about, because if I delete e.g. /hdp/apps/2.2.4.2-2/hive/hive.tar.gz will it be re-deployed at restarting services or how will those .gz and .jar's will be provided afterwards ?!?!&lt;/P&gt;</description>
      <pubDate>Thu, 21 Jan 2016 20:35:19 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-format-HDFS-in-an-already-installed-cluster-properly/m-p/106647#M15879</guid>
      <dc:creator>geko</dc:creator>
      <dc:date>2016-01-21T20:35:19Z</dc:date>
    </item>
    <item>
      <title>Re: how to format HDFS in an already installed cluster, properly ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-format-HDFS-in-an-already-installed-cluster-properly/m-p/106648#M15880</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1198/koenigbodensee.html" nodeid="1198"&gt;@Gerd Koenig&lt;/A&gt;  For blank copy&lt;/P&gt;&lt;P&gt;hadoop namenode -format  ( Don't use this in production or any env in use)&lt;/P&gt;&lt;P&gt;Now, re: Corrupt block --  see this &lt;A href="http://stackoverflow.com/questions/19205057/how-to-fix-corrupt-hadoop-hdfs" target="_blank"&gt;http://stackoverflow.com/questions/19205057/how-to-fix-corrupt-hadoop-hdfs&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A target="_blank" href="http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.4/bk_installing_manually_book/content/format_and_start_hdfs.html"&gt;Official doc&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Now, challenge is HA - I suggest to open a support case if you have access to support&lt;/P&gt;</description>
      <pubDate>Thu, 21 Jan 2016 20:48:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-format-HDFS-in-an-already-installed-cluster-properly/m-p/106648#M15880</guid>
      <dc:creator>nsabharwal</dc:creator>
      <dc:date>2016-01-21T20:48:12Z</dc:date>
    </item>
    <item>
      <title>Re: how to format HDFS in an already installed cluster, properly ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-format-HDFS-in-an-already-installed-cluster-properly/m-p/106649#M15881</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/1198/koenigbodensee.html" nodeid="1198"&gt;@Gerd Koenig&lt;/A&gt;&lt;P&gt;additionally, the files you're concerned with are distributed with our distribution, you can find them in /usr/hdp directory on your local filesystem.&lt;/P&gt;</description>
      <pubDate>Thu, 21 Jan 2016 22:36:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-format-HDFS-in-an-already-installed-cluster-properly/m-p/106649#M15881</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-01-21T22:36:50Z</dc:date>
    </item>
    <item>
      <title>Re: how to format HDFS in an already installed cluster, properly ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-format-HDFS-in-an-already-installed-cluster-properly/m-p/106650#M15882</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1198/koenigbodensee.html" nodeid="1198"&gt;@Gerd Koenig&lt;/A&gt; If you reformat hdfs you will be left without the whole /hdp folder and you'll have to recreate it. If you are sure everything else is now all right you better remove corrupted files and recreate them, they are all available in /usr/hdp/&amp;lt;hdp-version&amp;gt; and you can copy them to hdfs. Details can be found in the doc given by &lt;A rel="user" href="https://community.cloudera.com/users/140/nsabharwal.html" nodeid="140"&gt;@Neeraj Sabharwal&lt;/A&gt;. For example, hive and pig files are given &lt;A href="http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.4/bk_installing_manually_book/content/upload_pig_hive_sqoop_tarballs_to_hdfs.html"&gt;here&lt;/A&gt;, tez files &lt;A href="http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.4/bk_installing_manually_book/content/ref-95a45227-63fc-4920-9113-7a49a97bd011.1.html"&gt;here&lt;/A&gt; and so on. You can just delete files under /user/ambari-qa, they are result of some service checks, no need to recreate them.&lt;/P&gt;</description>
      <pubDate>Thu, 21 Jan 2016 22:45:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-format-HDFS-in-an-already-installed-cluster-properly/m-p/106650#M15882</guid>
      <dc:creator>pminovic</dc:creator>
      <dc:date>2016-01-21T22:45:45Z</dc:date>
    </item>
    <item>
      <title>Re: how to format HDFS in an already installed cluster, properly ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-format-HDFS-in-an-already-installed-cluster-properly/m-p/106651#M15883</link>
      <description>&lt;P&gt;DO NOT REFORMAT for missing blocks. If its not a test cluster, you need to identify how you ended up with missing blocks. One possible reason if you changed the data directories and removed some. If you identified the root cause and fine with it, just get the files missing from local and update into hdfs. And you can just delete the files in /user/ambari-qa that you listed.&lt;/P&gt;</description>
      <pubDate>Thu, 21 Jan 2016 22:55:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-format-HDFS-in-an-already-installed-cluster-properly/m-p/106651#M15883</guid>
      <dc:creator>ravi1</dc:creator>
      <dc:date>2016-01-21T22:55:25Z</dc:date>
    </item>
    <item>
      <title>Re: how to format HDFS in an already installed cluster, properly ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-format-HDFS-in-an-already-installed-cluster-properly/m-p/106652#M15884</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1198/koenigbodensee.html" nodeid="1198"&gt;@Gerd Koenig&lt;/A&gt;  Open the support ticket to handle this ...I would be doing the same if I am in your shoes. &lt;/P&gt;</description>
      <pubDate>Thu, 21 Jan 2016 22:59:40 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-format-HDFS-in-an-already-installed-cluster-properly/m-p/106652#M15884</guid>
      <dc:creator>nsabharwal</dc:creator>
      <dc:date>2016-01-21T22:59:40Z</dc:date>
    </item>
    <item>
      <title>Re: how to format HDFS in an already installed cluster, properly ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-format-HDFS-in-an-already-installed-cluster-properly/m-p/106653#M15885</link>
      <description>&lt;P&gt;Thanks @Neeraj .&lt;/P&gt;&lt;P&gt;Just to give you feedback of another 'solution'. In the meantime I received two more datanodes back (which were failing during installation time). After adding those hosts and restarting HDFS the corrupt block error disappeared without any further file deletion or HDFS re-formatting&lt;/P&gt;&lt;P&gt;Regards, Gerd
&lt;/P&gt;</description>
      <pubDate>Thu, 21 Jan 2016 23:17:03 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/how-to-format-HDFS-in-an-already-installed-cluster-properly/m-p/106653#M15885</guid>
      <dc:creator>geko</dc:creator>
      <dc:date>2016-01-21T23:17:03Z</dc:date>
    </item>
  </channel>
</rss>

