Hi, one of our developers needs to run pig scripts, access HDFS, run mapreduce on the CDH5 cluster from a remote machine. I'm a little confused how to accomplish this.
Do I need to add the remote machine to the cluster using the "Add Host" feature? I also read that I should make the remote machine a "Gateway", then download (deploy?) the client config files to that host. Am I on the right track? Thank you. -Mike