Reply
Contributor
Posts: 30
Registered: ‎10-25-2013
Accepted Solution

Access HDFS/MR from remote machine

Hi, one of our developers needs to run pig scripts, access HDFS, run mapreduce on the CDH5 cluster from a remote machine.  I'm a little confused how to accomplish this.

 

Do I need to add the remote machine to the cluster using the "Add Host" feature?  I also read that I should make the remote machine a "Gateway", then download (deploy?) the client config files to that host.  Am I on the right track?  Thank you.  -Mike

Highlighted
Expert Contributor
Posts: 101
Registered: ‎01-24-2014

Re: Access HDFS/MR from remote machine

[ Edited ]

Hi Mike,

 

yes I believe you are on the right track. you would need to add the remote host to the cluster, and then make it part of the gateway role group for hdfs and the gateway role group for mapreduce for that cluster. 

 

You could create a new rolegroup if the remote host needs to have different configuration than the other gateway nodes in your cluster for whatever reason. 

 

 

Announcements