About FrozenWave

FrozenWave · ‎10-18-2024

Solved by: - Deleting the Namenode role entirely from "HDFS --> Instances --> NameNode", clicking the checkbox near the Namenode Instance and selecting "delete" from the "Actions" dropdown menu - Redeploying a new "Namenode" role on the same host where the primary Namenode was previously running - Reenabling High Availability on the Namenode - Reconstructing the Namenode Metadata in the "Hive" service, under "Hive --> actions --> rebuild Namenode metadata"

FrozenWave · ‎07-29-2024

Thank you again @Rajat_710 . I Asked the Customer to revert the changes the OS Team performed at OS level. At this point, the Stale Configurations were still showing as pending but following a reboot of the "spurious" machine they cleared up. The situation is back to normal. I'm grateful for your valuable insight!

joha0123 · ‎09-08-2021

Hi, I am having the same issue on CDP 7.1.6 with Oozie 5.1.0. But the suggested solution does not seem to work anymore. Setting <property> <name>oozie.launcher.yarn.app.mapreduce.am.env</name> <value>SPARK_HOME=/opt/cloudera/parcels/CDH/lib/spark/</value> </property> has no effect. Is there anything else I can do? Did the setting change?

na · ‎02-15-2021

Good Day, Effective January 31, 2021, all Cloudera software requires a valid subscription and is only accessible from behind the paywall. This includes all legacy versions for Cloudera Distribution including Apache Hadoop (CDH), Hortonworks Data Platform(HDP), Data Flow (HDF/CDF),and Cloudera Data Science Workbench (CDSW). Information regarding paywall access will be available in technical documentation by software type and version. https://www.cloudera.com/downloads/paywall-expansion.html If you have a valid Cloudera Subscription, you can obtain your credentials for downloads following directions outlined here: https://docs.cloudera.com/cdp-private-cloud-base/latest/installation/topics/cdpdc-cm-download-information.html

FrozenWave · ‎10-21-2019

You can query the API exposed by Cloudera Manager and simplify your life. For example, you can run the following: curl -u <CM_USER>:<CM_PASSWD> http://<CM_IP_ADDRESS>:7180/api/v19/clusters/<CLUSTER_NAME>/services/hive2 You'll get a Json answer in reply to your Query, with all the details related to the desired service's status. You can finally parse your Json answer (e.g. using "jq" or directly inside your bash script) and take the desired actions HTH

mostafamarji · ‎05-09-2019

Did you solve the problem?

FrozenWave · ‎01-29-2018

1) Apparently, yes 2) The name of the user you're trying to use to log in to the remote system, I suppose. Pls note that the user you specify here would be the user "oozie" will run as, so you'd eventually get other problems of unpredictable nature when using Oozie 3) I don't really know, sorry about that... The fact is that even if I'm pretty sure to have understood the cause of your issue, I never had to deal with it directly myself. Maybe the easier way could be to follow the additional suggestions I wrote in my previous answer (give permissions to OS User "yarn" to "ssh" and/or "su"). Or, maybe, another possibility would be for you to create a "yarn" user on the remote system and grant this user with the correct permissions to get to the final working directory I hope you'll manage to get through the problems and make it 🙂

FrozenWave · ‎07-22-2017

Thanks mbigelow, following your suggestions I solved the massive error logging issue. I've processed in a Json validator the specific log file referenced in the Java stack trace: /user/spark/applicationHistory/application_1494352758818_0117_1 But the format was correct, according to the validator. So I just moved it away in a temporary directory. As soon as I did it, the error messages stopped clogging the system logs. So it was probably corrupted in a very subtle way... But it was definitely corrupted That Json file has been indeed generated by the Spark Action that is giving me problems, but it was an OLD file. New instances of that Spark Action are generating new Json logs, but they are not giving any troubles to the History Server (stopped having tons of exceptions logged as I just said) Unfortunately, the Spark job itself is still failing and it's needing further investigation on my side, so apparently this is not related to that specific error message. But I've solved an annoying problem, and at the same time I have cleared out the possibility of the Spark Action issue being related to that java exception Thanks!

FrozenWave · ‎06-27-2017

In the end I've been able to solve the issue. I've been tricked by the fact that applying again from scratch the "YARN Resources Allocation Tuning Guide" proposed a (in my opinion) misleading way of calculating a few important parameters. Guide can be found here: https://www.cloudera.com/documentation/enterprise/5-10-x/topics/cdh_ig_yarn_tuning.html In a matter of fact, the Guide contains a downloadable XLS file which is a tool for calculating optimal parameters. This XLS automatically calculates and proposes a few values to be assigned to YARN configuration: As you can see above, at step 4 I got proposed "2" for "yarn.nodemanager.resource.cpu-vcores" and "5632" for "yarn.nodemanager.resource.memory-mb" I later found out that the correct values to be assigned to those configurations are the 2 values proposed at "step 5" Definitely, partly my fault (I do not have deep knowledge of YARN configuration). But partly misleading doc indeed. I am now fine tuning, trying different settings for the various java heap sizes etc Still I have no idea why everything was working fine until recently and stopped working after upgrading to 5.11, as I did not change any configuration while upgrading and physical resources are identical

ZachRoes · ‎01-17-2017

A schema or protocol may not contain multiple definitions of a fullname. Further, a name must be defined before it is used ("before" in the depth-first, left-to-right traversal of the JSON parse tree, where the types attribute of a protocol is always deemed to come "before" the messages attribute.)

Online	Offline
Last Visited	‎11-27-2024 10:58 AM

Member Since	‎01-05-2016 04:28 AM
Last Visited	‎11-27-2024 10:58 AM
Posts	60
Kudos received	42

Cloudera Community

Re: CDH Express 6.3.1 - Can't start HDFS after clu...

Re: I would to write a script which will check the...

Re: Running shell scripts in oozie using hue

Re: Oozie Workflows problems after upgrading from ...

Re: Oozie workflow, Spark action (using simple Dat...

Re: CDH Express 6.3.1 - Can't start HDFS after clu...

Re: CDH Express 6.3.1 - Suddenly, hostname changed...

Re: Oozie workflow, Spark action (using simple Dat...

Re: Can't install CDH 6.3.1 Express Edition

Re: I would to write a script which will check the...

Re: adding a Kafka service failed

Re: Running shell scripts in oozie using hue

Re: Spark on Yarn - Unexpected end-of-input: was e...

Re: Oozie Workflows problems after upgrading from ...

Re: Pyspark: Table Dataframe returning empty recor...