<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Spark+R in Cloudera 5.3.0 in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-R-in-Cloudera-5-3-0/m-p/42631#M20460</link>
    <description>&lt;P&gt;(1) I would start by loading the SparkR package into RStudio so you can make use of it.&amp;nbsp;See the following link under heading "Using SparkR from RStudio"&amp;nbsp;&lt;A href="https://github.com/apache/spark/tree/master/R" target="_blank"&gt;https://github.com/apache/spark/tree/master/R&lt;/A&gt;.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;(2) Now you are ready to run through the following tutorial. However instead of reading the data from "hdfs" load it from your local file system.&lt;/P&gt;&lt;P&gt;&lt;A href="http://www.r-bloggers.com/a-first-look-at-spark/" target="_blank"&gt;http://www.r-bloggers.com/a-first-look-at-spark/&amp;nbsp;&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;(3) Study the SparkR Guide to gain more indepth knowledge.&lt;/P&gt;&lt;P&gt;&lt;A href="http://spark.apache.org/docs/latest/sparkr.html" target="_blank"&gt;http://spark.apache.org/docs/latest/sparkr.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;(4) Study Spark (Dataframes, RDDs, etc) for example with the Oreilly book "Learning Spark". I find that it always helps to understand how something works under the hood. The same holds for SparkR you can easily find some videos about Youtube to understand how it works under the hood, especially the distributed character of SparkR + Spark.&lt;/P&gt;</description>
    <pubDate>Wed, 06 Jul 2016 09:43:18 GMT</pubDate>
    <dc:creator>MVERVUURT</dc:creator>
    <dc:date>2016-07-06T09:43:18Z</dc:date>
    <item>
      <title>Spark+R in Cloudera 5.3.0</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-R-in-Cloudera-5-3-0/m-p/37706#M20459</link>
      <description>&lt;P&gt;Please accept my sincere salutations.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have to work with Spark using the R language in Cloudera 5.3.0. I have installed R and also the IDE RStudio in this latter.What will be&amp;nbsp;the next step?What can I do? How should I proceed?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I will be grateful if you suggest a tutorial.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Best Regards !&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 10:05:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-R-in-Cloudera-5-3-0/m-p/37706#M20459</guid>
      <dc:creator>RMG</dc:creator>
      <dc:date>2022-09-16T10:05:08Z</dc:date>
    </item>
    <item>
      <title>Re: Spark+R in Cloudera 5.3.0</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-R-in-Cloudera-5-3-0/m-p/42631#M20460</link>
      <description>&lt;P&gt;(1) I would start by loading the SparkR package into RStudio so you can make use of it.&amp;nbsp;See the following link under heading "Using SparkR from RStudio"&amp;nbsp;&lt;A href="https://github.com/apache/spark/tree/master/R" target="_blank"&gt;https://github.com/apache/spark/tree/master/R&lt;/A&gt;.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;(2) Now you are ready to run through the following tutorial. However instead of reading the data from "hdfs" load it from your local file system.&lt;/P&gt;&lt;P&gt;&lt;A href="http://www.r-bloggers.com/a-first-look-at-spark/" target="_blank"&gt;http://www.r-bloggers.com/a-first-look-at-spark/&amp;nbsp;&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;(3) Study the SparkR Guide to gain more indepth knowledge.&lt;/P&gt;&lt;P&gt;&lt;A href="http://spark.apache.org/docs/latest/sparkr.html" target="_blank"&gt;http://spark.apache.org/docs/latest/sparkr.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;(4) Study Spark (Dataframes, RDDs, etc) for example with the Oreilly book "Learning Spark". I find that it always helps to understand how something works under the hood. The same holds for SparkR you can easily find some videos about Youtube to understand how it works under the hood, especially the distributed character of SparkR + Spark.&lt;/P&gt;</description>
      <pubDate>Wed, 06 Jul 2016 09:43:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-R-in-Cloudera-5-3-0/m-p/42631#M20460</guid>
      <dc:creator>MVERVUURT</dc:creator>
      <dc:date>2016-07-06T09:43:18Z</dc:date>
    </item>
    <item>
      <title>Re: Spark+R in Cloudera 5.3.0</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-R-in-Cloudera-5-3-0/m-p/85607#M20461</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;SparkR has been removed from R Cran repository.Could you please advise what needs to be installed instead.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;Sankar&lt;/P&gt;</description>
      <pubDate>Tue, 29 Jan 2019 14:31:11 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Spark-R-in-Cloudera-5-3-0/m-p/85607#M20461</guid>
      <dc:creator>Sankar</dc:creator>
      <dc:date>2019-01-29T14:31:11Z</dc:date>
    </item>
  </channel>
</rss>

