<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question HDFS BlockPlacementPolicy, is there an alternative that considers available disk space? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDFS-BlockPlacementPolicy-is-there-an-alternative-that/m-p/134217#M43720</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I am wondering if there is an BlockPlacementPolicy that in addition to storing replicas safely on different racks as the default one does, also can consider how much disk space that is available on different nodes?&lt;/P&gt;&lt;P&gt;In case where you have a cluster that consists of two sets of machines with a big difference in the amount of available disk space, the default policy will lead to the disks of the set with a smaller amount of disk space running out of disk space long before you actually reach your total HDFS capacity.&lt;/P&gt;&lt;P&gt;Is there any such policy ready to be used?&lt;/P&gt;&lt;P&gt;Best Regards&lt;/P&gt;&lt;P&gt;Thomas&lt;/P&gt;</description>
    <pubDate>Mon, 17 Oct 2016 18:25:35 GMT</pubDate>
    <dc:creator>ThomasLarsson</dc:creator>
    <dc:date>2016-10-17T18:25:35Z</dc:date>
    <item>
      <title>HDFS BlockPlacementPolicy, is there an alternative that considers available disk space?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDFS-BlockPlacementPolicy-is-there-an-alternative-that/m-p/134217#M43720</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I am wondering if there is an BlockPlacementPolicy that in addition to storing replicas safely on different racks as the default one does, also can consider how much disk space that is available on different nodes?&lt;/P&gt;&lt;P&gt;In case where you have a cluster that consists of two sets of machines with a big difference in the amount of available disk space, the default policy will lead to the disks of the set with a smaller amount of disk space running out of disk space long before you actually reach your total HDFS capacity.&lt;/P&gt;&lt;P&gt;Is there any such policy ready to be used?&lt;/P&gt;&lt;P&gt;Best Regards&lt;/P&gt;&lt;P&gt;Thomas&lt;/P&gt;</description>
      <pubDate>Mon, 17 Oct 2016 18:25:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDFS-BlockPlacementPolicy-is-there-an-alternative-that/m-p/134217#M43720</guid>
      <dc:creator>ThomasLarsson</dc:creator>
      <dc:date>2016-10-17T18:25:35Z</dc:date>
    </item>
    <item>
      <title>Re: HDFS BlockPlacementPolicy, is there an alternative that considers available disk space?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDFS-BlockPlacementPolicy-is-there-an-alternative-that/m-p/134218#M43721</link>
      <description>&lt;P&gt;I just found that something like this was added somewhat recently:&lt;/P&gt;&lt;P&gt;&lt;A href="https://github.com/apache/hadoop/blob/f67237cbe7bc48a1b9088e990800b37529f1db2a/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/blockmanagement/AvailableSpaceBlockPlacementPolicy.java" target="_blank"&gt;https://github.com/apache/hadoop/blob/f67237cbe7bc48a1b9088e990800b37529f1db2a/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/blockmanagement/AvailableSpaceBlockPlacementPolicy.java&lt;/A&gt;&lt;/P&gt;&lt;P&gt;This seems to be what I was looking for.&lt;/P&gt;</description>
      <pubDate>Mon, 17 Oct 2016 18:34:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/HDFS-BlockPlacementPolicy-is-there-an-alternative-that/m-p/134218#M43721</guid>
      <dc:creator>ThomasLarsson</dc:creator>
      <dc:date>2016-10-17T18:34:54Z</dc:date>
    </item>
  </channel>
</rss>

