I have a CDH4.5 cluster, and I want to upload files into it from another server (e.g. database server).
With vanilla Hadoop and Hive, I can change the configuration files, pointing the namenode and metastore to remote services, and simply run:
dba@db-001$ hadoop fs -copyFromLocal /path/to/export.tsv
dba@db-001$ hive -e "load data local inpath '/path/to/export.tsv' into table test.my_table"
But how about CDH? What components should I install on other servers?