Member since
11-24-2015
223
Posts
10
Kudos Received
0
Solutions
04-15-2021
05:56 PM
Hi, we are trying to backup a kudu table as below : spark2-submit --principal <user> --keytab <keytab> --master yarn --deploy-mode cluster --queue <queue> --executor-memory 12G --executor-cores 4 --driver-memory 4G --driver-cores 1 --class org.apache.kudu.backup.KuduBackup kudu-backup2_2.11-1.13.0.7.1.5.0-257.jar --kuduMasterAddresses $KUDU_MASTERS --rootPath hdfs:///backups --forceFull true impala::<table> And it is super slow. Any suggestions on how to make it run faster? Appreciate the feedback.
... View more
Labels:
- Labels:
-
Apache Kudu
04-06-2021
10:49 AM
BTW files with ".tmp" extension could be under any subdirectory under "/backups". Thanks.
... View more
04-06-2021
10:45 AM
Hi, In BDR HDFS replication I want to exclude all files which end with ".tmp" in the directory "/backups/"
Appreciate it if somebody could give the expression for this to add in BDR "Add Exclusion".
Thanks
... View more
Labels:
- Labels:
-
Cloudera Manager
-
HDFS
04-16-2020
05:50 AM
We are trying to implement alerting in our cluster and alerting is setup in Cloudera Manager. So when I stop a service in Cloudera Manager, an alert is sent to my email. Because for some reason I hear that if you stop the service from CM, it is not the same as it crashing on its own. Especially with regards Canary Alerts, which we will not get if we stop a service through Cloudera Manager. So will I not get Canary Alerts for a service if the service is stopped through Cloudera Manager? Also I would like to know how to stop a service manually through Cloudera API. I would appreciate it if some forum member could give the command to stop - say Oozie or HBase - through Cloudera Manager API. Appreciate the help.
... View more
Labels:
- Labels:
-
Cloudera Manager
03-11-2019
02:02 PM
@Kuldeep Kulkarni, there are many lines with "input data" in the page you referred - not sure which ones to ignore. Should I ignore the sections for datasets/input events/output events - that will leave only the workflow section. Is that right? Can't I use the coordinator from your shell action example? But in that I don't see : "<app-path>${workflowAppUri}</app-path>" Appreciate the clarification.
... View more
03-08-2019
07:07 PM
@Kuldeep Kulkarni , does your example : https://community.hortonworks.com/articles/27497/oozie-coordinator-and-based-on-input-data-events.html set the job to run once a day? If not, can you please let me know how to do that? I want to run a job once daily. Thanks.
... View more
03-08-2019
06:58 PM
Hi Kuldeep, thanks so much for the clarification. I will try to do as per your instructions and let you know how it went. Thanks again.
... View more
03-08-2019
02:34 PM
@Kuldeep Kulkarni, I created python action based on https://community.hortonworks.com/content/supportkb/151119/how-to-run-a-python-script-using-oozie-shell-actio.html But how do I integrate coordinator.xml with that? I tried creating the file but it is not executing as per that. Is there somewhere in job.properties or workflow.xml that you mention coordinator.xml? Appreciate the feedback.
... View more
03-06-2019
07:21 PM
yes, I checked the nodes and found the output in one of them. I reran to make sure. So the script is not really needed on the Linux box? all of them - job.properties, the shellscript, workflow.xml, coordinator.xml? They need to be only in hdfs? Also next, how to execute a python code from oozie? Also I want it to run daily. Appreciate the insights.
... View more