<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Spark ML smoke test? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-ML-smoke-test/m-p/105088#M38024</link>
    <description>&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;We have HDP 2.4.2 cluster configured with Spark. I did run smoke tests (spark PI, shell, Spark SQL) for various components. I am looking forward to a few smoke tests to prove that spark has been configured with ML libraries. Moreover, how to make sure that Spark ML configurations are optimized?&lt;/P&gt;&lt;P&gt;I was planning to run a couple of samples from &lt;A href="https://spark.apache.org/docs/1.6.1/mllib-guide.html" target="_blank"&gt;https://spark.apache.org/docs/1.6.1/mllib-guide.html&lt;/A&gt; to make sure ML libs are configured. Is that enough?&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;SS&lt;/P&gt;</description>
    <pubDate>Tue, 16 Aug 2016 22:06:42 GMT</pubDate>
    <dc:creator>smartninja723</dc:creator>
    <dc:date>2016-08-16T22:06:42Z</dc:date>
    <item>
      <title>Spark ML smoke test?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-ML-smoke-test/m-p/105088#M38024</link>
      <description>&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;We have HDP 2.4.2 cluster configured with Spark. I did run smoke tests (spark PI, shell, Spark SQL) for various components. I am looking forward to a few smoke tests to prove that spark has been configured with ML libraries. Moreover, how to make sure that Spark ML configurations are optimized?&lt;/P&gt;&lt;P&gt;I was planning to run a couple of samples from &lt;A href="https://spark.apache.org/docs/1.6.1/mllib-guide.html" target="_blank"&gt;https://spark.apache.org/docs/1.6.1/mllib-guide.html&lt;/A&gt; to make sure ML libs are configured. Is that enough?&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;SS&lt;/P&gt;</description>
      <pubDate>Tue, 16 Aug 2016 22:06:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-ML-smoke-test/m-p/105088#M38024</guid>
      <dc:creator>smartninja723</dc:creator>
      <dc:date>2016-08-16T22:06:42Z</dc:date>
    </item>
    <item>
      <title>Re: Spark ML smoke test?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-ML-smoke-test/m-p/105089#M38025</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/3021/smartninja723.html" nodeid="3021"&gt;@Smart Solutions&lt;/A&gt;,&lt;/P&gt;&lt;P&gt;I think this would be sufficient to certify that the libraries are installed and your applications will be able to find them. You can find several examples that are ready to run under /usr/hdp/current/spark-client/examples/src/main/python/mllib. You can substitute python with your preferred language to find examples that correspond to the appropriate API.&lt;/P&gt;&lt;P&gt;In terms of optimized configurations, it is hard to tune that upfront as it will be highly dependent upon on your application, dataset, and cluster.&lt;/P&gt;</description>
      <pubDate>Mon, 22 Aug 2016 08:15:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-ML-smoke-test/m-p/105089#M38025</guid>
      <dc:creator>bwilson</dc:creator>
      <dc:date>2016-08-22T08:15:05Z</dc:date>
    </item>
  </channel>
</rss>

