Member since
01-19-2017
3681
Posts
633
Kudos Received
372
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 1626 | 06-04-2025 11:36 PM | |
| 2086 | 03-23-2025 05:23 AM | |
| 988 | 03-17-2025 10:18 AM | |
| 3765 | 03-05-2025 01:34 PM | |
| 2591 | 03-03-2025 01:09 PM |
03-25-2020
10:38 AM
Below are the steps to troubleshoot distcp:- 1. it is not problem with the hdfs or Kerberos or distcp but a MapReduce. 2. We tried to run a sample MR job to test, then it failed with the following exception Error: Java.io.IOException: initialization of all the collectors failed. Error in last collector was:java.io.IOException: Invalid “mapreduce.task.io.sort.mb”:3276. (The total amount of buffer memory to use while sorting files, in MB). It was expecting less than 2048. Changing this property able to run the distcp smooth. I want to take a moment and say thanks to Shelton for responding it on time.
... View more
03-05-2020
11:41 PM
@SushantRao where is the source code for Cloudera Manager and all the other CDH7 components ? Is there documentation available to locate and build the packages ?
... View more
02-19-2020
04:44 PM
1 Kudo
@WilsonLozano,
As this thread is older and was marked 'Solved back in August of 2019 you would have a better chance of receiving a resolution by starting a new thread. This will also provide the opportunity to provide details specific to your environment, version of CDH, etc. that could aid others in providing a more accurate answer to your question.
... View more
02-03-2020
01:28 PM
1 Kudo
Dear Jay what to say excellent answer , you are really one of the best here
... View more
01-31-2020
05:58 AM
1 Kudo
@Manoj690 It's always a good idea to share the HDP and Zk version plus the zk logs in /var/log/* having said that can you share your zoo.cfg ? If you really need enable all four letter word commands by default, you can use the asterisk option so you don't have to include every command one by one in the list.See below 4lw.commands.whitelist=* As you have not shared your logs that's a starting point, then restart your zookeeper and let me know!
... View more
01-30-2020
04:14 AM
@Shelton we are still experiencing slowness in ambari UI after enabling the above properties.
... View more
01-24-2020
06:27 AM
Hi @Shelton Finally I've solved the problem and hope my experience could help someone else who is banging his/her head on this scenario. At time of writing I was accessing the GCP cluster via public IP addresses. I had a /etc/hosts file resolving hostnames to their public IP Then customer's NOC granted me access from internal network over VPN and next attempt went fine. Still can't understand why, maybe some networking related config issue, but being on the internal side of the network solved my problem Simple as that! Thanks for your precious support, and patience Stefano
... View more
01-15-2020
10:02 AM
@Shelton @EricL Thank you both. the correct ACL spec is group::r-x Now the following command works. sudo -u zeppelin hadoop fs -ls /warehouse/tablespace/managed/hive/test1 From what I just ran into, I feel that, by design, Hive takes extra effort to prevent users from accessing managed table files directly. I will follow that design and access Hive managed table only through Hive.
... View more
01-13-2020
03:37 PM
Sorry for the bump. I tried this in the HDP Sandbox and discovered that installing krb5-workstation-1.15.1-37.el7_7.2.x86_64.rpm solved the problem
... View more
01-12-2020
12:56 PM
1 Kudo
@mike_bronson7 When your cluster is in HA it uses a namespace that acts as a load balancer to facilitate the switch from active to standby and vice versa. The dfs-site-xml holds these values filter using dfs.nameservices the nameservice-id should be your namespace or in HA look for dfs.ha.namenodes.[nameservice ID] dfs.ha.namenodes.[nameservice ID] e.g dfs.ha.namenodes.mycluster And that's the value to set eg hdfs://mycluster_namespace/user/ams/hbase The refresh the stale configs , now HBase should sending the metrics to that directory HTH
... View more