Support Questions

Find answers, ask questions, and share your expertise

How to remotely connect to HDP2.4VM with RStudio using R?

avatar
Rising Star

Hi,

I have experience using Hive with Ambari. However, I would like to use Hive on the RDP2.4VM with RStudio using R. Simply put, connecting to the hortonwork vm remotely using R. If a user has done this, can they please tell me here how to do this and also where to find literature online on how to accomplish this if it is documented? I would also appreciate any tips on how to set up the dependencies if this was accomplished using rHive.

Thanks, Heath

1 ACCEPTED SOLUTION

avatar
14 REPLIES 14

avatar

@Heath Yates

Take a look at the below posting. It lists all the dependencies as well as setup instructions (not all steps will apply to you though).

http://www.rdatamining.com/big-data/r-hadoop-setup-guide

avatar
Rising Star

Sure. Will you be around tonight in case I have questions? I'll try this in an hour or two after my commute home and dinner. I actually got RStudio working on the HDP sandbox. Do you think that will make things simpler? At that point things local and hopefully simpler. I do hope to be able to access a Hadoop/hive server remotely eventually, but hope doing things locally will simplify this problem.

avatar

@Heath Yates

link for RStudio Commercial pro Version:

https://www.rstudio.com/products/rstudio/download-commercial/ Pro will work for 45 days without license.

Download Server:

https://www.rstudio.com/products/rstudio/download-server/

Documentation:

https://s3.amazonaws.com/rstudio-server/rstudio-server-pro-0.99.903-admin-guide.pdf

URL for connecting Remotely

http://<SandBox IP:8787/auth-sign-in

avatar
Rising Star

Going to try HADOOP_HOME and HIVE_HOME paths soon with the tutorial I found. I will let you know if it works. In the meantime, could you please tell me how you found that path information? I am willing to learn and appreciate the time you have taken to reply.

avatar

I installed RStudio on my production environment and handling Since 2 yrs.

avatar
Rising Star

I am getting the error '/root/RHive/usr/lib/hive/lib does not exist' when I do ant build in the ~/Rhive directory. Please see tutorial here for details. I am stuck on step 4.

avatar

Here is the link for RHive

https://github.com/nexr/RHive

Just FYI..RHive & rhdfs both are same.

avatar
Rising Star

Just kidding, the error has not resolved. I will mark yours as answer if I can get ant to build Rhive or get Hive working R. 🙂

avatar

sure..If this is what you wanted, please vote the response and accepted it as a best answer.