I am using Greenplum Database along with HDP Hadoop 2.3.6. Greenplum gives gphdfs protocol to connect and access data from HDFS. To use gphdfs, it is required to install Hadoop binaries on GPDB cluster. Is there any document that provides information on that. I am this document but they have not specified the binaries that are required to install HDP.
Hi @Govind Tagai,
You can setup remote repositories using - https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.6/bk_installing_manually_book/content/config-...
Once they are setup, you can use yum to install the clients that you need. Example - yum install hadoop hadoop-hdfs hadoop-libhdfs hadoop-yarn hadoop-mapreduce hadoop-client openssl
The above steps should be performed on all the greenplum hosts.