About venkatsambath

venkatsambath · ‎06-01-2023

Hi @ac-ntap This exception is not related. please ignore this and kindly review my other comment "https://community.cloudera.com/t5/Support-Questions/Hive-query-failed-with-java-io-IOException-Cannot-find/m-p/371879/highlight/true#M241106"

venkatsambath · ‎05-31-2023

I am sorry for the delay. This is the exception I was looking for. Specifically we can notice the failure was due to hive trying to pickup password from its own jceks rather than using the plain text password. Still its unclear why it falls back to jceks instead of using password. I will check with existing known issues. For the timebeing Can you try configuring the password through jceks as indicated in this article https://my.cloudera.com/knowledge/How-to-configure-HDFS-and-Hive-to-use-different-JCEKS-and?id=326056 You can ignore step 9 to 11 from the article. sg-cdp is the bucket name, so instead of fs.s3a.bucket.scc-803070-bucket-1.security.credential.provider.path please use the property fs.s3a.bucket.sg-cdp.security.credential.provider.path

venkatsambath · ‎05-24-2023

Is this the complete exception stack trace you have in logs. The stack is supposed to have traces of getPasswordFromCredentialProviders which I dont find and further more stack trace. Can you attach the entire application log.

venkatsambath · ‎05-24-2023

d. Additionally please try disabling "Generate HADOOP_CREDSTORE_PASSWORD" in CM > Hive and Hive on Tez > configuration just to make sure the jceks generated by hive is not interfering with the creds you use.

venkatsambath · ‎05-24-2023

Since the issue is intermittent it is unclear yet what is triggering the problem. It is not a must to have jceks. a. Do you see any pattern on the failure, like the issue happens only when the failing task runs on a particular node. b. How are you managing the edited core-site.xml, is it through cloudera-manager? May i know the safety valve used. c. Can you attach the complete application log for the failed run and successful run?

venkatsambath · ‎05-23-2023

@ac-ntap Can you check the steps in the article https://my.cloudera.com/knowledge/How-to-configure-HDFS-and-Hive-to-use-different-JCEKS-and?id=326056 and let me know if that helps.

venkatsambath · ‎03-16-2023

@Me Sorry for that confusion. I see what you mean now Per: https://impala.apache.org/docs/build/html/topics/impala_perf_stats.html#perf_stats_incremental COMPUTE INCREMENTAL STATS In Impala 2.1.0 and higher, you can use the COMPUTE INCREMENTAL STATS and DROP INCREMENTAL STATS commands. The INCREMENTAL clauses work with incremental statistics, a specialized feature for partitioned tables. When you compute incremental statistics for a partitioned table, by default Impala only processes those partitions that do not yet have incremental statistics. By processing only newly added partitions, you can keep statistics up to date without incurring the overhead of reprocessing the entire table each time. So the drop statistics is intended for "COMPUTE INCREMENTAL STATS" and not for " COMPUTE INCREMENTAL STATS with partition" May I know which version of CDP you are using, so that I can test on my end and confirm you.

venkatsambath · ‎03-14-2023

Hi, This statement in the doc "In cases where new files are added to an existing partition, issue a REFRESH statement for the table, followed by a DROP INCREMENTAL STATS and COMPUTE INCREMENTAL STATS sequence for the changed partition." Applies specifically to a partition in which stats are already available but you added more data to that existing partition. If you are unsure about whether stats exist for a partition you can run show table stats <table_name>; and check the "Incremental stats" section Query: show table stats test_part +-------+-------+--------+------+--------------+-------------------+--------+-------------------+--------------------------------------------------------------------------+ | b | #Rows | #Files | Size | Bytes Cached | Cache Replication | Format | Incremental stats | Location | +-------+-------+--------+------+--------------+-------------------+--------+-------------------+--------------------------------------------------------------------------+ | 1 | 0 | 1 | 0B | NOT CACHED | NOT CACHED | TEXT | false | hdfs://xxxx:8020/user/hive/warehouse/test_part/b=1 | | Total | -1 | 1 | 0B | 0B | | | | | +-------+-------+--------+------+--------------+-------------------+--------+-------------------+--------------------------------------------------------------------------+ Fetched 2 row(s) in 5.60s If false, you can run COMPUTE INCREMENTAL STATS with PARTITION If true and you have added more data to this partition then you have to drop the stats and then run COMPUTE INCREMENTAL STATS with PARTITION

venkatsambath · ‎02-22-2023

Hi. Yeah its expected when you have the common path for tgt cache for multiple user. Can you make the location unique for each different user - I haven't tested but I see an option in this link https://gpdb.docs.pivotal.io/6-3/admin_guide/kerberos-win-client.html Set up the Kerberos credential cache file. On the Windows system, set the environment variable KRB5CCNAME to specify the file system location of the cache file. The file must be named krb5cache. This location identifies a file, not a directory, and should be unique to each login on the server. When you set KRB5CCNAME, you can specify the value in either a local user environment or within a session. For example, the following command sets KRB5CCNAME in the session: set KRB5CCNAME=%USERPROFILE%\krb5cache

venkatsambath · ‎01-09-2023

You can set quota on /tmp - Once quota is reached further write on the directory will fail. https://docs.cloudera.com/cdp-private-cloud-base/7.1.6/scaling-namespaces/topics/hdfs-set-quotas-cm.html has the steps to enable quota

Online	Offline
Last Visited	‎02-11-2025 05:17 PM

Member Since	‎12-11-2015 07:09 AM
Last Visited	‎02-11-2025 05:17 PM
Posts	217
Kudos received	30

Cloudera Community

Re: Versions Compatibility

Re: Utilization Report - Cloudera Platform

Re: Run 2 kerberos ticket in a server for transfer...

Re: in-place upgrade CM problem(CM 7.4.4 to CM 7.7...

Re: Hive query failed with java.io.IOException: Ca...

Re: Hive query failed with java.io.IOException: Ca...

Re: Hive query failed with java.io.IOException: Ca...

Re: Hive query failed with java.io.IOException: Ca...

Re: Hive query failed with java.io.IOException: Ca...

Re: Hive query failed with java.io.IOException: Ca...

Re: Hive query failed with java.io.IOException: Ca...

Re: COMPUTE INCREMENTAL STATS with PARTITION optio...

Re: COMPUTE INCREMENTAL STATS with PARTITION optio...

Re: Could not create more than one ticket for mult...

Re: limit the size of files that an application ca...