<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Advice on Hardware Specification in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Advice-on-Hardware-Specification/m-p/18262#M2806</link>
    <description>&lt;P&gt;Thanks for your advice.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have a confusion starting from my first glance on Hadoop. When we are mentioning about server nowsaday, we actually refer to a VM on a VMWare ESXi or similar vm platform. However, for Hadoop, we are making use of the distributive nature.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am still confused for a long time. For a production deployment, can I allocate VMs on the same server to use Cloudera? I am concerning the normal practice.&lt;/P&gt;</description>
    <pubDate>Fri, 05 Sep 2014 08:26:50 GMT</pubDate>
    <dc:creator>Charles Siu</dc:creator>
    <dc:date>2014-09-05T08:26:50Z</dc:date>
    <item>
      <title>Advice on Hardware Specification</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Advice-on-Hardware-Specification/m-p/18176#M2804</link>
      <description>&lt;P&gt;I am planning to build a cloudera cluster. The application is about text processing on web pages. As a beginner of Hadoop, we would like to have a small scale to start with.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Could any one share the requirement for a small cloudera cluster? For example, the CPU, RAM, hard disk drives, and number of nodes in the cluster?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 09:06:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Advice-on-Hardware-Specification/m-p/18176#M2804</guid>
      <dc:creator>Charles Siu</dc:creator>
      <dc:date>2022-09-16T09:06:44Z</dc:date>
    </item>
    <item>
      <title>Re: Advice on Hardware Specification</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Advice-on-Hardware-Specification/m-p/18182#M2805</link>
      <description>Hello Charles, as a beginner it would be easier if you experimented with&lt;BR /&gt;Hadoop on AWS instances before buying hardware. You can begin by building a&lt;BR /&gt;simple 3-4 node cluster. The hardware requirements depend on your planned&lt;BR /&gt;work but you can begin with nodes with 8GB RAM and storage based in your&lt;BR /&gt;data set. Get familiar with the software and then look to scale upward&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 04 Sep 2014 11:15:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Advice-on-Hardware-Specification/m-p/18182#M2805</guid>
      <dc:creator>GautamG</dc:creator>
      <dc:date>2014-09-04T11:15:35Z</dc:date>
    </item>
    <item>
      <title>Re: Advice on Hardware Specification</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Advice-on-Hardware-Specification/m-p/18262#M2806</link>
      <description>&lt;P&gt;Thanks for your advice.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have a confusion starting from my first glance on Hadoop. When we are mentioning about server nowsaday, we actually refer to a VM on a VMWare ESXi or similar vm platform. However, for Hadoop, we are making use of the distributive nature.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am still confused for a long time. For a production deployment, can I allocate VMs on the same server to use Cloudera? I am concerning the normal practice.&lt;/P&gt;</description>
      <pubDate>Fri, 05 Sep 2014 08:26:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Advice-on-Hardware-Specification/m-p/18262#M2806</guid>
      <dc:creator>Charles Siu</dc:creator>
      <dc:date>2014-09-05T08:26:50Z</dc:date>
    </item>
    <item>
      <title>Re: Advice on Hardware Specification</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Advice-on-Hardware-Specification/m-p/18392#M2807</link>
      <description>If you are only looking to learn, you are fine with using multiple VMs on&lt;BR /&gt;the same host. But performance will be poor if the VMs are starved for CPU&lt;BR /&gt;or if they share disks.&lt;BR /&gt;&lt;BR /&gt;Looks like you are just beginning to use Hadoop, so I would suggest first&lt;BR /&gt;getting up to speed with installation, and configuration rather than&lt;BR /&gt;performance. Get yourself a copy of these two books:&lt;BR /&gt;&lt;BR /&gt;- Hadoop Operations / Eric Sammer&lt;BR /&gt;&lt;A target="_blank" href="http://shop.oreilly.com/product/0636920025085.do"&gt;http://shop.oreilly.com/product/0636920025085.do&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;- Hadoop: The Definitive Guide&lt;BR /&gt;&lt;A target="_blank" href="http://shop.oreilly.com/product/9780596521981.do"&gt;http://shop.oreilly.com/product/9780596521981.do&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 08 Sep 2014 12:16:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Advice-on-Hardware-Specification/m-p/18392#M2807</guid>
      <dc:creator>GautamG</dc:creator>
      <dc:date>2014-09-08T12:16:17Z</dc:date>
    </item>
    <item>
      <title>Re: Advice on Hardware Specification</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Advice-on-Hardware-Specification/m-p/18588#M2808</link>
      <description>&lt;P&gt;Thanks for your advice.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Currently, I'm getting myself familiar by building the cluster with several VMs on a Linux host. I will get the copy of the books you mentioned!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Now, hope I my understanding is correct, in production environment, each node corresponds to a physical server in a rack. If I want to setup a 4-node cluster, I will probably have 4 1U servers on my rack.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;It seems I'd better go for AWS or Google Cloud first. Is there any good option? I just wonder when we use AWS, we are actually using VMs.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;</description>
      <pubDate>Thu, 11 Sep 2014 06:38:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Advice-on-Hardware-Specification/m-p/18588#M2808</guid>
      <dc:creator>Charles Siu</dc:creator>
      <dc:date>2014-09-11T06:38:13Z</dc:date>
    </item>
  </channel>
</rss>

