<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: RHadoop on CDH5.3, rmr2  and rhdfs packages are  not available for R 3.2.1 in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/RHadoop-on-CDH5-3-rmr2-and-rhdfs-packages-are-not-available/m-p/31682#M7193</link>
    <description>&lt;P&gt;Thank you so much "Sowen" for your suggestions !&lt;/P&gt;</description>
    <pubDate>Thu, 10 Sep 2015 09:12:34 GMT</pubDate>
    <dc:creator>RMG</dc:creator>
    <dc:date>2015-09-10T09:12:34Z</dc:date>
    <item>
      <title>RHadoop on CDH5.3, rmr2  and rhdfs packages are  not available for R 3.2.1</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/RHadoop-on-CDH5-3-rmr2-and-rhdfs-packages-are-not-available/m-p/31483#M7191</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I would like to try RHadoop on CDH 5.3 that's why I should install the "rmr2" and "rhdfs" packages under R but I &amp;nbsp;was surprised that these two packages are not available for the recent version of R(3.2.1) which is installed on Centos.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;What should I do I need these two packages urgently!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Best regards.&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 09:39:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/RHadoop-on-CDH5-3-rmr2-and-rhdfs-packages-are-not-available/m-p/31483#M7191</guid>
      <dc:creator>RMG</dc:creator>
      <dc:date>2022-09-16T09:39:48Z</dc:date>
    </item>
    <item>
      <title>Re: RHadoop on CDH5.3, rmr2  and rhdfs packages are  not available for R 3.2.1</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/RHadoop-on-CDH5-3-rmr2-and-rhdfs-packages-are-not-available/m-p/31484#M7192</link>
      <description>&lt;P&gt;I haven't tried those packages in a while (not since R 3.2 at least), but I know they haven't been updated in a while:&amp;nbsp;&lt;A href="https://github.com/RevolutionAnalytics/rhdfs/releases" target="_blank"&gt;https://github.com/RevolutionAnalytics/rhdfs/releases&lt;/A&gt; It wouldn't surprise me if they're not maintained now, especially given Revolution is probably shifting gears now that they're part of MSFT. I don't know. It's really a question for Revo or those open source projects.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Not sure if it helps, but here's a way you could use local Hadoop binaries to read from HDFS and then just pipe the result into R.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Edit your ~/.Renviron to set up Hadoop env variables. For me on my Mac it's:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;HADOOP_CMD=/usr/local/Cellar/hadoop/2.7.1/bin/hadoop
HADOOP_HOME=/usr/local/Cellar/hadoop/2.7.1/
HADOOP_CONF_DIR=/Users/srowen/Documents/Cloudera/hadoop-conf/&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;where hadoop-conf is a copy of the config directory from my cluster.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Then in R something like:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;data&amp;nbsp;&amp;lt;- read.csv(pipe("hdfs dfs -cat /user/sowen/data/part-*"), header=FALSE))&lt;/PRE&gt;&lt;P&gt;&lt;SPAN&gt;You get the idea.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For rmr2: I'd suggest you don't really want to run MapReduce. &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt; It's pretty easy to trigger an R script&amp;nbsp;from, say, a Spark task and parallelize a bunch of R scripts across the cluster with its "pipe" command. That's roughly what rmr2 helps you do. You still have to set up R across the cluster.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;There's also SparkR on the way, but still pretty green.&lt;/P&gt;</description>
      <pubDate>Wed, 02 Sep 2015 22:27:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/RHadoop-on-CDH5-3-rmr2-and-rhdfs-packages-are-not-available/m-p/31484#M7192</guid>
      <dc:creator>srowen</dc:creator>
      <dc:date>2015-09-02T22:27:47Z</dc:date>
    </item>
    <item>
      <title>Re: RHadoop on CDH5.3, rmr2  and rhdfs packages are  not available for R 3.2.1</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/RHadoop-on-CDH5-3-rmr2-and-rhdfs-packages-are-not-available/m-p/31682#M7193</link>
      <description>&lt;P&gt;Thank you so much "Sowen" for your suggestions !&lt;/P&gt;</description>
      <pubDate>Thu, 10 Sep 2015 09:12:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/RHadoop-on-CDH5-3-rmr2-and-rhdfs-packages-are-not-available/m-p/31682#M7193</guid>
      <dc:creator>RMG</dc:creator>
      <dc:date>2015-09-10T09:12:34Z</dc:date>
    </item>
    <item>
      <title>Re: RHadoop on CDH5.3, rmr2  and rhdfs packages are  not available for R 3.2.1</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/RHadoop-on-CDH5-3-rmr2-and-rhdfs-packages-are-not-available/m-p/33163#M7194</link>
      <description>Sowen, I am new to R and its configuration, i am working on a project where i need to import some data from hdfs(Cloudera CDH clister) on to my Windows running R/Rstudio environment.&lt;BR /&gt;I had r/rstudio installed on windows and while trying to install rhdfs package which requires setting up of HADOOP_CMD environment variable pointing to hadoop binaries.&lt;BR /&gt;&lt;BR /&gt;My Hadoop cluster is running on Linux and any suggestions how i can set this HADOOP_CMD variable pointing to hadoop binaries on my windows running R environment? thank you!</description>
      <pubDate>Tue, 20 Oct 2015 14:58:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/RHadoop-on-CDH5-3-rmr2-and-rhdfs-packages-are-not-available/m-p/33163#M7194</guid>
      <dc:creator>Poodah</dc:creator>
      <dc:date>2015-10-20T14:58:50Z</dc:date>
    </item>
  </channel>
</rss>

