About jagadeesan

jagadeesan · ‎04-26-2024

Flume, Storm, Druid, Falcon, Mahout, Ambari, Pig, Sentry, and Navigator have changed or been removed in CDP with replaced components . For Storm can be replaced with Cloudera Streaming Analytics (CSA) powered by Apache Flink. Contact your Cloudera account team for more information about moving from Storm to CSA. You can refer comparing Storm and Flink also Migrating from Storm to Flink.

DianaTorres · ‎04-22-2024

@Alaaeldin Has the reply helped resolve your issue? If so, please mark the appropriate reply as the solution, as it will make it easier for others to find the answer in the future. Thanks.

skasireddy · ‎04-24-2023

Sorry for late response, I use oozie to submit a spark job

AsimShaikh · ‎12-02-2022

@sss123 Are you able to run spark commands via spark-shell spark-submit?

smdas · ‎12-01-2022

Hello @QiDam As stated by JD above, the CDE Service relies on Ozone Service on Base Cluster. If Ozone Service wasn't in Healthy State, the CDE Service enabling would fail with similar tracing as shared by you. We would recommend the following checks: Ensure Ozone Service is Up & Running on Base Cluster. Create a new Environment & check CDE Service enabling on the new Environment. If the above CDE Service enabling on new Environment is Successful, Reattempt the CDE Service enabling on the existing Environment. If the above Suggestion doesn't help, We would suggest engaging Support as any further troubleshooting would require sharing the Logs over the Public Community forum, which may have Customer's details. We shall mark the Post as Resolved now. If you have any concerns, Feel free to Update the Post likewise. Regards, Smarak

nur.majid · ‎11-01-2022

Hi @Siddu198 Add this config to your job: set("mapreduce.fileoutputcommitter.algorithm.version","2")

jeromedruais · ‎09-26-2022

Hello @jagadeesan , @rki_ parameters you mentioned do not appear in Ambari. Does that mean our clusters are running with the default settings, exposing the clusters to the vulnerability ? Please, could you provide the way to set this parameters (which custom settings for Spark 1 and Spark 2 as well as the keys and values). Thanks in advance.

RangaReddy · ‎08-31-2022

Hi @nvelraj Pyspark job working locally because in your local system pandas library is installed, so it is working. When you run in cluster, pandas library/module is not available so you are getting the following error. ModuleNotFoundError: No module named 'pandas' To solve the. issue, you need to install the pandal library/module in all machines or use Virtual environment.

paulo_klein · ‎08-12-2022

To solve "unable to find valid certification path to requested target" I just import the certificate to java and restart the Zeppeling Server. ### LINUX LIST CERT cd /usr/lib/jvm/java-11-openjdk-11.0.15.0.10-2.el8_6.x86_64/bin ./keytool -list -keystore /usr/lib/jvm/java-11-openjdk-11.0.15.0.10-2.el8_6.x86_64/lib/security/cacerts ### LINUX IMPORT CERT ./keytool --import --alias keystore_cloudera --file /var/lib/cloudera-scm-agent/agent-cert/cm-auto-global_cacerts.pem -keystore /usr/lib/jvm/java-11-openjdk-11.0.15.0.10-2.el8_6.x86_64/lib/security/cacerts

jagadeesan · ‎08-02-2022

@Asim- JDBC also you need HWC for Managed tables. Here is the example for Spark2, but as mentioned earlier Spark3 we don't have any other way to connect Hive ACID tables from Apache Spark other than HWC and it is not yet a supported feature for Spark3.2 / CDS 3.2 in CDP 7.1.7. Marking this thread close, if you have any issues related to external tables kindly start a new Support-Questions thread for better tracking of the issue and documentation. Thanks

Online	Offline
Last Visited	‎02-25-2025 03:13 PM

Member Since	‎11-12-2018 10:00 AM
Last Visited	‎02-25-2025 03:13 PM
Posts	202
Kudos received	177

Cloudera Community

Re: Apache Storm support in Cloudera

Re: Complete example for using spark MLlib for twi...

Re: CDP - Zeppeling: Spark + Livy + Hive - HWC

Re: CDP - Zeppelin - Livy Error

Re: Spark3 connection to HIVE ACID Tables

Re: Apache Storm support in Cloudera

Re: Complete example for using spark MLlib for twi...

Re: Unable to connect remote Hadoop cluster using ...

Re: The Spark session could not be created in the ...

Re: Enabling of CDE service failed

Re: How to change Spark _temporary directory when ...

Re: CVE-2022-33891

Re: pyspark toPandas() works locally but fails in ...

Re: CDP - Zeppeling: Spark + Livy + Hive - HWC

Re: Spark3 connection to HIVE ACID Tables