<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: What's the best way to do Monte Carlo simulation on Hadoop in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-s-the-best-way-to-do-Monte-Carlo-simulation-on-Hadoop/m-p/97313#M10692</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/438/pcoates.html" nodeid="438"&gt;@Peter Coates&lt;/A&gt; can you accept the best answer to close this thread?&lt;/P&gt;</description>
    <pubDate>Tue, 02 Feb 2016 09:48:07 GMT</pubDate>
    <dc:creator>aervits</dc:creator>
    <dc:date>2016-02-02T09:48:07Z</dc:date>
    <item>
      <title>What's the best way to do Monte Carlo simulation on Hadoop</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-s-the-best-way-to-do-Monte-Carlo-simulation-on-Hadoop/m-p/97308#M10687</link>
      <description>&lt;P&gt;Monte Carlo and is one of many simulation types that execute a huge amount of repetitive tasks that use relatively little data. The "data" is usually little more than sets of parameters to a function that must be executed a zillion times. Often this is followed by some kind of summarizing process.  Clearly a custom MR job can be written for this, but is there any kind of standard frameworks that HDP recommends, or a published set of best practices? &lt;/P&gt;</description>
      <pubDate>Wed, 18 Nov 2015 22:32:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-s-the-best-way-to-do-Monte-Carlo-simulation-on-Hadoop/m-p/97308#M10687</guid>
      <dc:creator>pcoates</dc:creator>
      <dc:date>2015-11-18T22:32:08Z</dc:date>
    </item>
    <item>
      <title>Re: What's the best way to do Monte Carlo simulation on Hadoop</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-s-the-best-way-to-do-Monte-Carlo-simulation-on-Hadoop/m-p/97309#M10688</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/438/pcoates.html" nodeid="438"&gt;@Peter Coates&lt;/A&gt; This was brought by couple of DS guys. We discussed using Spark &lt;A target="_blank" href="https://cyberfrontierlabs.com/2014/08/18/setting-up-spark-and-hortonworks-sandbox/"&gt;link&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 18 Nov 2015 22:33:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-s-the-best-way-to-do-Monte-Carlo-simulation-on-Hadoop/m-p/97309#M10688</guid>
      <dc:creator>nsabharwal</dc:creator>
      <dc:date>2015-11-18T22:33:44Z</dc:date>
    </item>
    <item>
      <title>Re: What's the best way to do Monte Carlo simulation on Hadoop</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-s-the-best-way-to-do-Monte-Carlo-simulation-on-Hadoop/m-p/97310#M10689</link>
      <description>&lt;P&gt;To add to this, as a rule of thumb, Spark is the best choice when it comes to executing iterative algorithm. It helps that there is inbuilt ML Lib. I haven't seen anyone writing MR by hand anymore (except recently met one of the customers of our competitors because they were misled into believing 'hive is slow'.). &lt;/P&gt;</description>
      <pubDate>Thu, 19 Nov 2015 01:40:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-s-the-best-way-to-do-Monte-Carlo-simulation-on-Hadoop/m-p/97310#M10689</guid>
      <dc:creator>bsaini</dc:creator>
      <dc:date>2015-11-19T01:40:38Z</dc:date>
    </item>
    <item>
      <title>Re: What's the best way to do Monte Carlo simulation on Hadoop</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-s-the-best-way-to-do-Monte-Carlo-simulation-on-Hadoop/m-p/97311#M10690</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/438/pcoates.html" nodeid="438"&gt;@Peter Coates&lt;/A&gt;
&lt;/P&gt;&lt;P&gt; why do you need Spark if the data is very small and can fit on a single node? There are other excellent Monte Carlo simulation packages which can do this efficiently -- open source or otherwise. Even Excel has an add-in for this.&lt;/P&gt;&lt;P&gt;edit: If you need more horsepower for Monte Carlo simulations which one node can't provide, you can look at MPI. Mpich is pretty good: &lt;A href="https://www.mpich.org/" target="_blank"&gt;https://www.mpich.org/&lt;/A&gt; There's even a Yarn adapter for Mpich: &lt;A href="https://github.com/alibaba/mpich2-yarn" target="_blank"&gt;https://github.com/alibaba/mpich2-yarn&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 05 Dec 2015 02:16:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-s-the-best-way-to-do-Monte-Carlo-simulation-on-Hadoop/m-p/97311#M10690</guid>
      <dc:creator>dkumar1</dc:creator>
      <dc:date>2015-12-05T02:16:47Z</dc:date>
    </item>
    <item>
      <title>Re: What's the best way to do Monte Carlo simulation on Hadoop</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-s-the-best-way-to-do-Monte-Carlo-simulation-on-Hadoop/m-p/97312#M10691</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/191/bsaini.html" nodeid="191"&gt;@bsaini&lt;/A&gt;&lt;P&gt;Iterative computations are best in Spark for large data sets, not for CPU bound processes which use a small data set repeatedly.&lt;/P&gt;</description>
      <pubDate>Sat, 05 Dec 2015 02:19:00 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-s-the-best-way-to-do-Monte-Carlo-simulation-on-Hadoop/m-p/97312#M10691</guid>
      <dc:creator>dkumar1</dc:creator>
      <dc:date>2015-12-05T02:19:00Z</dc:date>
    </item>
    <item>
      <title>Re: What's the best way to do Monte Carlo simulation on Hadoop</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-s-the-best-way-to-do-Monte-Carlo-simulation-on-Hadoop/m-p/97313#M10692</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/438/pcoates.html" nodeid="438"&gt;@Peter Coates&lt;/A&gt; can you accept the best answer to close this thread?&lt;/P&gt;</description>
      <pubDate>Tue, 02 Feb 2016 09:48:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-s-the-best-way-to-do-Monte-Carlo-simulation-on-Hadoop/m-p/97313#M10692</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-02-02T09:48:07Z</dc:date>
    </item>
    <item>
      <title>Re: What's the best way to do Monte Carlo simulation on Hadoop</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-s-the-best-way-to-do-Monte-Carlo-simulation-on-Hadoop/m-p/97314#M10693</link>
      <description>&lt;P&gt;Here is an example: &lt;A href="https://community.hortonworks.com/articles/36321/predicting-stock-portfolio-losses-using-monte-carl.html" target="_blank"&gt;https://community.hortonworks.com/articles/36321/predicting-stock-portfolio-losses-using-monte-carl.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 03 Jun 2016 23:11:19 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/What-s-the-best-way-to-do-Monte-Carlo-simulation-on-Hadoop/m-p/97314#M10693</guid>
      <dc:creator>vzlatkin</dc:creator>
      <dc:date>2016-06-03T23:11:19Z</dc:date>
    </item>
  </channel>
</rss>

