Member since
06-13-2019
3
Posts
0
Kudos Received
0
Solutions
12-14-2021
05:33 AM
Hi, We used to do following to access phoenix from pyspark in HDP environment. pyspark --jars /usr/hdp/2.6.4.0-91/phoenix/phoenix-spark2.jar,/usr/hdp/2.6.4.0-91/phoenix/phoenix-client.jar,/usr/hdp/2.6.4.0-91/phoenix/phoenix-server.jar Now with CDP 7.1.5 which jars need to be included when running pyspark. The documentation states a maven manifest file, nothing on pyspark. Looking forward for help. Best Regards, Ajaz-Ur-Rahman
... View more
Labels:
11-19-2021
05:44 AM
We are trying to copy Data from our HDP 2.6 to CDP 7.1 Cluster. both clusters are not secure as yet. We are using following command to copy the data from source to the target cluster. hadoop distcp hdfs://SourceNameNodeIP:8020/apps/hive/warehouse/network_stg.db/stg_aint_voice_init hdfs://TargetNameNodeIP:8020//apps/hive/warehouse/tablespace/external/stg_aint_voice_init Not all ports are open on the target, we have requested customer to allow traffic to ports 8020,1004, 50010, 50070, 1006, 14000. The distcp starts and able to create a zero byte file in the target but when the map job starts it fails with the error. The customer can't allow traffic on all ports.
... View more
Labels: