Created 11-26-2025 05:51 AM
Created 12-07-2025 11:30 PM
@Amr5
From the logs you shared the core issue is
FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.tez.TezTask
Vertex failed: INIT_FAILURE
Unable to instantiate class with 1 arguments:org.apache.hadoop.hive.ql.exec.tez.HiveSplitGenerator
Caused by: java.lang.ExceptionInInitializerErrorThis indicates a classpath/library compatibility issue between Informatica and the upgraded Cloudera CDP cluster, specifically with Tez and Hive components.
Root Causes:
Solution 1
Update the Informatica Hadoop connection:
Update Hadoop libraries:
Copy new CDP client libraries to Informatica
Restart Informatica services:
Additionally Identify and copy missing Tez JARs
Update classpathin Informatica domain configuration
Configure Hive Execution Engine If Tez is causing issues, temporarily switch to MapReduce.
In your Hive connection properties, add:
Update this post after the above steps and always share the logs to enable us understand whats happening in your environment.
Happy hadooping
Created 12-09-2025 07:10 AM
@Shelton ,
First Many thanks for sharing your insights with me much appreciated.
we have re run this workflow again from informatica now again with some new details same errors are coming.
Logs:
SEVERE: The Integration Service failed to run the Hive task [Write_test_76ti7ctz]. See the additional error messages for more information.
com.informatica.sdk.dtm.ExecutionException: [[HIVE_1070] The Integration Service failed to run Hive query [Write_test_76ti7ctz_query_3] for task [Write_test_76ti7ctz] due to following error: Hive error code [2], Hive message [Error while compiling statement: FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.tez.TezTask. Vertex failed, vertexName=Map 1, vertexId=vertex_1764836049228_0036_1_00, diagnostics=[Vertex vertex_1764836049228_0036_1_00 [Map 1] killed/failed due to:INIT_FAILURE, Fail to create InputInitializerManager, java.util.concurrent.ExecutionException: org.apache.tez.dag.api.TezReflectionException: Unable to instantiate class with 1 arguments: org.apache.hadoop.hive.ql.exec.tez.HiveSplitGenerator
2025-12-09 16:44:25,009 AST <LdtmWorkflowTask-pool-6-thread-8> INFO: Could not rename /tmp/sqoop-infadpdev/673817950266090/Fact_Bills_DD.java to /tmp/sqoop-infadpdev/673817950266090/Fact_Bills_DD.java. Error: File element in parameter 'null' already exists: '/tmp/sqoop-infadpdev/673817950266090/Fact_Bills_DD.java'
2025-12-09 16:44:21,704 AST <MappingCompiler-pool-4-thread-3> INFO: [LDTMCMN_0037] The Hadoop distribution directory is defined in Data Integration Service properties at the path [/data1/informatica/dei/services/shared/hadoop/CDH_7.218].
2025-12-09 16:44:21,704 AST <MappingCompiler-pool-4-thread-3> INFO: [CLUSTERCONF_10024] The cluster configuration [Cloudera_Dev] is unchanged from the last export. Using the existing export file [/data1/informatica/dei/tomcat/bin/disTemp/DOM_IDQ_DEV/DIS_DEI_DEV/node02_DEI_DEV/cloudera_dev/SPARK/665d244e-0368-4d51-8ac9-01e1ac851a1f/infacco-site.xml].
2025-12-09 16:44:21,704 AST <MappingCompiler-pool-4-thread-3> INFO: [CLUSTERCONF_10028] Based on the distribution [CLOUDERA] and the run-time engine [SPARK], the Data Integration Service will override the following cluster configuration properties at run time: \n - fs.file.impl.disable.cache: true\n - fs.hdfs.impl.disable.cache: true\n - yarn.timeline-service.enabled: false
--------------------
run this and locate command for hive-exec.jar on informatica server and see that all hive-exec.jars were on older cloudera version which was 7.1.9
sudo find / -type f -name "hive-exec.jar" 2>/dev/null
Unable to instantiate class with 1 arguments: org.apache.hadoop.hive.ql.exec.tez.HiveSplitGenerator is in hive-exec.jar
which is pointing to old path.
Created 12-09-2025 07:56 AM
@Amr5
Just as you too realized there is an old path issue. The error indicates a version mismatch between the Hive/Tez libraries being used by Informatica and those expected by your Cloudera cluster.
RCA
1. Informatica is using Hive libraries from an older Cloudera version (7.1.9)
2. Your cluster is running Cloudera 7.2.18 (as shown in the path /data1/informatica/dei/services/shared/hadoop/CDH_7.218
3. The HiveSplitGenerator class in the old hive-exec.jar is incompatible with the newer Tez runtime
Step 1: Locate Current Hive Libraries
Step 2: Backup Old Libraries
Step 3: Copy Correct Hive Libraries from Cluster
# Find the correct hive-exec.jar on your Cloudera cluster
find /opt/cloudera/parcels -name "hive-exec*.jar" 2>/dev/null
# Copy it to Informatica's Hadoop distribution directory
cp /opt/cloudera/parcels/CDH-7.2.18*/lib/hive/lib/hive-exec-*.jar \
/data1/informatica/dei/services/shared/hadoop/CDH_7.218/
In Informatica Administrator Console:
Step 5: Restart Services
Step 6: Clear Cached Compilation Files
Step 7: Re-run Your Mapping
If you have multiple nodes in your Informatica cluster, repeat Steps 2-3 on all nodes where the Data Integration Service runs.
Happy hadooping
Created on 12-10-2025 02:55 AM - edited 12-10-2025 04:04 AM
@Shelton
after following shared steps mentioned, we re run the mapping getting this error now:
java.lang.NoSuchMethodError:
'org.apache.hadoop.hive.ql.parse.ParseResult org.apache.hadoop.hive.ql.parse.ParseDriver.parse(java.lang.String, org.apache.hadoop.conf.Configuration)'
please also note this that old jars for hive-exec also there in path :
/data1/informatica/dei/services/shared/hadoop/CDH_7.218
data1/informatica/dei/clients/DeveloperClient/hadoop/CDH_7.218/lib/hive-exec-3.1.3000.7.2.18.0-641.jar
/data1/informatica/dei/clients/DeveloperClient/hadoop/CDH_7.218/lib/hive-exec-3.1.3000.7.3.1.100-57.jar
/data1/informatica/dei/clients/DeveloperClient/hadoop/CDH_7.218/spark/jars/hive-exec-3.1.3000.7.2.18.0-641.jar
*****/data1/informatica/dei/clients/DeveloperClient/hadoop/CDH_7.1/lib/hive-exec-3.1.3000.7.1.9.0-387.jar
/data1/informatica/dei/externaljdbcjars/hive-exec-3.1.3000.7.3.1.100-57.jar
/data1/informatica/dei/services/shared/hadoop/CDH_7.218/lib/hive-exec-3.1.3000.7.1.9.0-387.jar
/data1/informatica/dei/services/shared/hadoop/CDH_7.218/lib/hive-exec-3.1.3000.7.2.18.0-641.jar
/data1/informatica/dei/services/shared/hadoop/CDH_7.218/lib/hive-exec-3.1.3000.7.3.1.100-57.jar
/data1/informatica/dei/services/shared/hadoop/CDH_7.218/spark/jars/hive-exec-3.1.3000.7.1.9.0-387.jar
/data1/informatica/dei/services/shared/hadoop/CDH_7.218/spark/jars/hive-exec-3.1.3000.7.2.18.0-641.jar
/data1/informatica/dei/services/shared/hadoop/CDH_7.218/spark/jars/hive-exec-3.1.3000.7.3.1.100-57.jar
*** /data1/informatica/dei/services/shared/hadoop/CDH_7.1/lib/hive-exec-3.1.3000.7.1.9.0-387.jar
*** /data1/informatica/dei/services/shared/hadoop/CDH_7.1/spark/jars/hive-exec-3.1.3000.7.1.9.0-387.jar
/data1/informatica/dei/services/shared/spark/lib_spark/hive-exec-3.1.3000.7.1.9.0-387.jar
/data1/informatica/dei/services/shared/spark/lib_spark/hive-exec-3.1.3000.7.2.18.0-641.jar
/data1/informatica/dei/services/shared/spark/lib_spark/hive-exec-3.1.3000.7.3.1.100-57.jar
Created on 12-10-2025 04:20 AM - edited 12-10-2025 04:21 AM
The NoSuchMethodError means JAR conflict at runtime. You must ensure that only CDH 7.2.18 Hive JARs are in the classpath, with no remnants of 7.1.9.
The ParseDriver.parse() method signature changed between Hive versions. In your case the Old Hive JARs (from CDH 7.1.9) are still present in /data1/informatica/dei/services/shared/hadoop/CDH_7.218
Java is loading the old hive-exec.jar instead of the new one, causing method signature mismatches
Step 1. Identify ALL Old Hive JARs
Step 2: Remove ALL Old Hive JARs
Step 3: Copy ALL Correct Hive JARs from Cloudera Cluster
Step 4: Verify Correct Versions
cd /data1/informatica/dei/services/shared/hadoop/CDH_7.218
ls -lh hive*.jar | head -5
# Check the version inside hive-exec.jar
unzip -p hive-exec-*.jar META-INF/MANIFEST.MF | grep -i version
Step 5: Clear Java Classpath Cache
Step 6: Restart Informatica Services
Step 7: Verify Hadoop Distribution in Informatica Admin Console
Step 8: Re-run Your Mapping
Happy Hadooping