About Kartik_Agarwal

Kartik_Agarwal · ‎04-12-2023

@aleezeh Letus know if you also getting this error unable to find valid certification path to requested target. If yes there could be chances that kafka truststore does not have ranger admin certificate you can import the ranger admin certificate to kafka truststore. keytool -importcert -file /tmp/ranger.cer -keystore kafka_plugin_truststore.jks If you found that the provided solution(s) assisted you with your query, please take a moment to login and click Accept as Solution below each response that helped.

Kartik_Agarwal · ‎04-12-2023

@aleezeh Can you please attach the log file to further investigate on this? What version of HDP or CDP you are using?

Kartik_Agarwal · ‎04-12-2023

@rajilion It seems that you are using the -update flag with distcp command, which is causing the command to skip files that exist in the destination and have a modification time equal to or newer than the source file. This is the expected behavior of distcp when the -update flag is used. In your case, even though the content of the file has changed, the size and modification time are still the same, which is causing distcp to skip the file during the copy process. To copy the updated file to S3, you can try removing the -update flag from the distcp command. This will force distcp to copy all files from the source directory to the destination, regardless of whether they exist in the destination or not. Your updated command would look like this: hadoop distcp -pu -delete hdfs_path s3a://bucket The -pu flag is used to preserve the user and group ownership of the files during the copy process. Please note that removing the -update flag can cause distcp to copy all files from the source directory to the destination, even if they haven't been modified. This can be time-consuming and may result in unnecessary data transfer costs if you have a large number of files to copy. If you only want to copy specific files that have been modified, you can use a different tool such as s3-dist-cp or aws s3 sync that supports checksum-based incremental copies. These tools use checksums to determine which files have been modified and need to be copied, rather than relying on modification times or file sizes. If you found that the provided solution(s) assisted you with your query, please take a moment to login and click Accept as Solution below each response that helped.

Kartik_Agarwal · ‎04-12-2023

You should be able to access cluster using below link. https://www.cloudera.com/campaign/try-cdp-public-cloud.html#:~:text=Try%20CDP%20Public%20Cloud%20for,hybrid%20and%20multi%2Dcloud%20data If you found that the provided solution(s) assisted you with your query, please take a moment to login and click Accept as Solution below each response that helped.

Kartik_Agarwal · ‎03-30-2023

@jjjjanine Thanks for providing your valuable inputs.

Kartik_Agarwal · ‎03-27-2023

I have checked this internally currently it's not supported for CDP as of now but we have an internal Jira to test and certify this in the future.

Kartik_Agarwal · ‎01-17-2023

@fernando_lopez I'm checking on this internally and will keep you posted.

Kartik_Agarwal · ‎01-16-2023

The spark/yarn/hive jobs uses local directories for localization purpose. The required data which is used by the application is stored on local directories when the application is in execution. Please manually delete the older data from local /tmp directory. Also, Please follow the steps mentioned in following article to clear the cache memory from local directory : https://community.cloudera.com/t5/Community-Articles/How-to-clear-local-file-cache-and-user-cache-for-yarn/ta-p/245160 Please manually delete the unwanted data from local /tmp directory and also follow the above article If you found that the provided solution(s) assisted you with your query, please take a moment to login and click Accept as Solution below each response that helped.

Kartik_Agarwal · ‎01-16-2023

@techfriend this can be resolved after modifiying the principle. WARNING: no policy specified for mapred/ip-172-31-46-169.us-west-2.compute.internal@HADM.RU; defaulting to no policy add_principal: Principal or policy already exists while creating "mapred/ip-172-31-46-169.us-west-2.compute.internal@HADM.RU". + '[' 604800 -gt 0 ']' ++ kadmin -k -t /var/run/cloudera-scm-server/cmf5922922234613877041.keytab -p cloudera-scm/admin@HADM.RU -r HADM.RU -q 'getprinc -terse mapred/ip-172-31-46-169.us-west-2.compute.internal@HADM.RU' ++ tail -1 ++ cut -f 12 + RENEW_LIFETIME=0 + '[' 0 -eq 0 ']' + echo 'Unable to set maxrenewlife' + exit 1 modprinc -maxrenewlife 90day +allow_renewable mapred/ip-172-31-46-169.us-west-2.compute.internal@HADM.RU

Kartik_Agarwal · ‎01-10-2023

To resolve this issue, open /opt/cloudera/cm/bin/gen_credentials.sh for editing and add the following to the very end of the script: exit ${PIPESTATUS[0]} And then try generating Kerberos credentials once more.

Online	Offline
Last Visited	‎10-29-2025 06:14 PM

Member Since	‎03-11-2020 04:39 AM
Last Visited	‎10-29-2025 06:14 PM
Posts	197
Kudos received	30

Cloudera Community

Re: Creating SCM configuration file in /etc/cloude...

Re: How to disable kerberos in hive once it has be...

Re: /var/log directory is being changed permission...

Re: Nifi authentication ldap can login but cant ac...

Re: How to put the alert name into the subject of ...

Re: How to make kafka plugin communicate with poli...

Re: enable-kafka-plugin.sh script failing and cred...

Re: Hadoop Distcp -update skips file

Re: Cloudera CDP Test cluster

Re: org.apache.hadoop.util.DiskChecker$DiskErrorEx...

Re: When is CDP going to support Rocky Linux?

Re: When is CDP going to support Rocky Linux?

Re: org.apache.hadoop.util.DiskChecker$DiskErrorEx...

Kerberos Generate Credentials fails

Re: while generating the missing credantial gettin...