Member since
07-31-2013
1924
Posts
462
Kudos Received
311
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 1969 | 07-09-2019 12:53 AM | |
| 11881 | 06-23-2019 08:37 PM | |
| 9147 | 06-18-2019 11:28 PM | |
| 10134 | 05-23-2019 08:46 PM | |
| 4580 | 05-20-2019 01:14 AM |
11-14-2017
12:40 AM
Raising the heap should mostly help with this, as @saranvisa points out. The GC is usually driven by inadequate free heap memory, and adding more allocated heap memory helps with that. However, if the GCs are occurring without any change/increase in load, it could also be related to history retention configurations of the JobTracker. > Cm -> Yarn -> Configuration -> "Garbage Collection Duration Monitoring Period" Increase it from 5 mins to 10 mins I'd not recommend doing this, as it only hides the problem - You should be concerned if there's excessive GC due to heap pressure, and the default alert window and thresholds covers it well. Only increase it if you're absolutely sure the GC spikes are temporary and workload-driven but non-concerning, which doesn't seem determined yet. Also, if you want to check the heap usage before increasing the heap by any value, you may see it in the heap charts shown on the JobTracker instance page of CM.
... View more
11-12-2017
05:33 PM
Yes, that is right. In MySQL though, you can ease user access provisioning by granting wildcard host login access, to allow all hosts: https://dev.mysql.com/doc/refman/5.7/en/adding-users.html (look for the % character example on page).
... View more
11-08-2017
03:43 AM
1 Kudo
You can test cross realm trust by using MIT Kerberos' 'kvno' command: Assuming realms A and B, 1. kinit as any identity from realm A 2. Run: kvno hdfs/namenode-host@B If kvno grabs a service ticket, everything is fine with the trust between B and A Repeat the test in the inverse fashion, with A's namenode-host to check the other direction. P.s. Ensure that the encryption types of all the krbtgt principals is the same on both KDCs (verify with getprinc ), and that both clusters pass the Hosts -> Security Inspector check in CM.
... View more
11-08-2017
02:32 AM
For your first scenario's error, you appear to be attempting to use an unsecure client config to talk to a remote secure cluster. Perhaps you're using a gateway host that does not have updated client configs? Ideally the error should mirror the second scenario error type. For your second scenario's error, the two realms do not appear to have a cross-realm trust setup. If you're using MIT Kerberos, follow this for your two realms: https://access.redhat.com/documentation/en-US/Red_Hat_Enterprise_Linux/5/html/Deployment_Guide/sec-kerberos-crossrealm.html
... View more
11-08-2017
12:59 AM
Is there any pattern to this? For ex., do the few tasks that hang all run on the same host or specific set of hosts among all nodes in the cluster? A more detailed root cause can be sought by performing a jstack on a task that appears hung live. This is done by first finding which host the hung task is running on (within the task timeout period, after noticing it hanging), discovering its container ID and finding the associated java process on the machine followed by the jstack command run on the PID.
... View more
11-08-2017
12:07 AM
It seems like your VM either has too little RAM or is unable to get adequate CPU cycles to run the RS continually, per the below snippet: 2017-06-06 22:21:59,870 WARN [regionserver/quickstart.cloudera/127.0.0.1:60020] util.Sleeper: We slept 81086ms instead of 3000ms, this is likely due to a long garbage collecting pause and it's usually bad, see http://hbase.apache.org/book.html#trouble.rs.runtime.zkexpired Can you try increasing the RAM available to the VM?
... View more
11-07-2017
11:47 PM
1 Kudo
The standalone Spark 2.x is designed to co-exist with the CDH-included Spark 1.6, and as such all the commands differ. The command difference list is available at https://www.cloudera.com/documentation/spark2/latest/topics/spark_running_apps.html#spark2_commands
... View more
11-07-2017
11:31 PM
1 Kudo
The command 'pyspark' is for Spark 1.6 so it certainly needs a Spark Gateway to function. If you want to use PySpark with Spark 2, the command is 'pyspark2' instead.
... View more
11-07-2017
11:02 PM
If you're managing your Hue service via Cloudera Manager, you can do the dump and load via the UI. Stop the Hue service with the SQLite configuration, then click 'Dump Database' under the Hue service page's Actions button. Next, reconfigure the stopped Hue service to use your new MySQL DB, and before starting it go back to the Actions button and click 'Load Database'.
... View more
11-07-2017
10:57 PM
To find what's included in a CDH release, visit the 'CDH Version and Packaging Information' area of the documentation: https://www.cloudera.com/documentation/enterprise/release-notes/topics/rg_cdh_vd.html Specifically, to find what's in the 5.13.x and 5.12.x releases, visit the following links: - https://www.cloudera.com/documentation/enterprise/release-notes/topics/cm_vd_cdh_package_tarball_513.html - https://www.cloudera.com/documentation/enterprise/release-notes/topics/cm_vd_cdh_package_tarball_512.html Kafka is currently not a part of the base CDH packaging, and is available as a separate parcel. Follow the Kafka doc. page for instructions on how to add it to your cluster: https://www.cloudera.com/documentation/kafka/latest/topics/kafka.html Sqoop and Flume are included in CDH5 since its inception, and Kudu is included in CDH5 since 5.13.x (previously it was a separate parcel).
... View more