Member since
01-22-2024
1
Post
0
Kudos Received
0
Solutions
01-20-2026
08:38 PM
Hi Team, We are planning to deploy the Iceberg Hive runtime JAR (iceberg-hive-runtime.jar) in our CDP Private Cloud Base 7.1.9 cluster (Cloudera Manager 7.11.3) to enable Hive and Spark to work with Iceberg tables. Iceberg Hive runtime JAR: iceberg-hive-runtime-1.3.1.jar,iceberg-spark-runtime-3.3_2.12-1.3.1.jar step: 1 Upload the JAR to HDFS Create a directory in HDFS: hdfs dfs -mkdir -p /user/share/lib Upload the JAR: hdfs dfs -put iceberg-hive-runtime.jar /user/share/lib/ hdfs dfs -chmod 755 /user/share/lib/iceberg-hive-runtime.jar Configure Hive via Cloudera Manager Navigate to Hive → Configuration → Auxiliary JARs Directory Add: hdfs:///user/share/lib Step 2: Configure Spark via Cloudera Manager Navigate to Spark → Configuration → Spark Client Advanced Configuration Snippet (Safety Valve) Add: spark.yarn.jars hdfs:///user/share/lib/iceberg-hive-runtime.jar Step 3: Restart Services Restart Hive, Spark, and YARN. We would like guidance and confirmation on the steps, any possible impacts, and best practices to ensure a smooth deployment. Are the steps outlined above correct and fully supported for CDP 7.1.9? Are there any additional recommendations we should be aware of? Can you provide guidance on the best way to validate the deployment? Environment Details: CDP Private Cloud Base: 7.1.9 Cloudera Manager: 7.11.3 Hadoop Cluster (metadata connection) Keytab authentication Spark Universal enabled
... View more
Labels:
- Labels:
-
Apache Iceberg