I have experience using Hive with Ambari. However, I would like to use Hive on the RDP2.4VM with RStudio using R. Simply put, connecting to the hortonwork vm remotely using R. If a user has done this, can they please tell me here how to do this and also where to find literature online on how to accomplish this if it is documented? I would also appreciate any tips on how to set up the dependencies if this was accomplished using rHive.
Sure. Will you be around tonight in case I have questions? I'll try this in an hour or two after my commute home and dinner. I actually got RStudio working on the HDP sandbox. Do you think that will make things simpler? At that point things local and hopefully simpler. I do hope to be able to access a Hadoop/hive server remotely eventually, but hope doing things locally will simplify this problem.
Going to try HADOOP_HOME and HIVE_HOME paths soon with the tutorial I found. I will let you know if it works. In the meantime, could you please tell me how you found that path information? I am willing to learn and appreciate the time you have taken to reply.
I still am getting the error for step 4. You mentioned rhdfs and rhive are both the same. Which is newer and which one should I be using in R then? I am just trying to get Hive functionality in my Rscripts. Thanks. 🙂
I think we are close? The error states 'BUILD FAILED' at /root/Rhive/build.xml:39: /root/RHive/usr/hdp/current/hive-server2/lib does not exist'. Shouldn't it be looking in /usr/hdp/current and not in /root/Rhive? Not sure what build.xml is doing. 😞
I'm not sure if this will fix the problem. I will try, but curios as to why you suggest this? I think this original question I asked here has been resolved, but this ant build problem has merited a separate question and asked it here.