About rki_

rki_ · ‎10-28-2022

Hi @mike_bronson7 , The thumb rule is to have 1 Gb of heap allocated to 1 Million block. Now, as you have already doubled the heap, I think you can check on improving the garbage collector tunings. https://community.cloudera.com/t5/Community-Articles/NameNode-Garbage-Collection-Configuration-Best-Practices-and/ta-p/245276 You can try tunings the GC threads to see if that helps. Been said that, these pauses are not very huge to cause any real trouble and we don't see it repeating very frequently ( within mins) and can be expected on a busy Cluster. You can also check if the threshold to report a JVM pause alert can be adjusted to say 5-10 secs which needs some intervention.

rki_ · ‎10-28-2022

Hi @TheFixer , During read, hbase has to fetch the blocks from other slave nodes if the locality of the regions are not good and this can contribute to latency as the block in not local to the client (Hbase ) and has to go over the network to fetch those provided the disks are fast enough. So you may want to run a major compaction on that table to see if that improves the read performance. Further, when the reads are performed the 1st time on a table that is written recently, the blocks are not cached in memory. So subsequent reads should give good performance comparatively as the blocks will be cache in the BlockCache.

rki_ · ‎10-21-2022

Hi @hanumanth , The network bandwidth needs to be limited at the OS side using the tools such as traffic control (tc) which can be set on the NIC which carries the ip address for the CDH roles. For info on this can be found in the Red Hat docs : https://access.redhat.com/solutions/69133 https://access.redhat.com/solutions/1324033

rki_ · ‎10-18-2022

Hi @fengsh Yes, The remove script indeed has few bugs using curl which was picked up from AMBARI-18435 but this curl call is not officially supported and as such we don't have an official doc on this. The other option to remove the older stack with "yum remove <version>" command, specifying the exact version you want to remove should get the job done. You can also verify the list of packages to be removed using (please be thorough!): # yum list installed | grep -P 'version' Additionally, please verify the symlinks in the "/usr/hdp/current/" folder are pointing to the correct location. -- Was your question answered? Please take some time to click on “Accept as Solution” below this post. If you find a reply useful, say thanks by clicking on the thumbs up button.

rki_ · ‎10-11-2022

Hi @fengsh , You can check the already solved below posts to see if that helps. https://community.cloudera.com/t5/Support-Questions/How-to-remove-an-old-HDP-version/m-p/116161 https://community.cloudera.com/t5/Support-Questions/Is-there-any-risk-to-delete-old-HDP-directories/m-p/96183 https://community.cloudera.com/t5/Community-Articles/Remove-Old-Stack-Versions-script-doesnt-work-in-ambari-2-7/ta-p/249303

rki_ · ‎10-05-2022

Hi, Those parameter won't be exposed by Ambari and would be false by default. The parameters would go into Custom spark-defaults. As they are disabled by default, I would suggest not to enable them.

rki_ · ‎09-28-2022

Hi, Inside Spark, you can check for spark.history.ui.acls.enable and spark.acls.enable. These should be false by default. https://spark.apache.org/docs/2.4.3/security.html#authentication-and-authorization

rki_ · ‎09-19-2022

Hi @Anlarin , It is always suggested to have a homogeneous disk storage across Datanodes. Within datanode, if there are heterogeneous volumes, then when the block replicas are written to new disks on a Round Robin fashion, the disks with less capacity will fill up faster compared to the disks with higher size. If the client is local to Node 2, then it will place the 1st block on that node and it's expected to fill faster. By choosing "Available Space Policy" the DNs would take into account how much space is available on each volume/disks when deciding where to place a new replica. To achieve writes that are evenly distribution in percentage of capacity on drives, change the choosing policy (dfs.datanode.fsdataset.volume.choosing.policy)to Available Space. If using Cloudera Manager: Navigate to HDFS > Configuration > DataNode Change DataNode Volume Choosing Policy from Round Robin to Available Space Click Save Changes Restart the DataNodes The above property only helps for volumes within Datanode. https://docs.cloudera.com/documentation/enterprise/latest/topics/admin_dn_storage_balancing.html - Was your question answered? Please take some time to click on “Accept as Solution” below this post. If you find a reply useful, say thanks by clicking on the thumbs up button.

rki_ · ‎09-15-2022

Hi @isoardi , Seeing sockets in TIME_WAIT state is normal and is by design when the socket is getting closed. Unless we see tens of thousands of sockets in TIME_WAIT state which would consume the ephemeral ports on the host , these are fine. It would be the CLOSE_WAIT sockets we need check which indicates the application has not called the close() call on the socket. You can refer the below RedHat documentation for more info on this and ways to close the TW sockets by reusing them. https://access.redhat.com/solutions/24154

rki_ · ‎09-15-2022

Hi @abdebja , You can refer the instructions provided in the below Cloudera Article to mitigate this issue. https://my.cloudera.com/knowledge/tmp-folder-filling-up-frequently-with-hprof-dump-files?id=340673 - Was your question answered? Please take some time to click on “Accept as Solution” below this post. If you find a reply useful, say thanks by clicking on the thumbs up button.

Online	Offline
Last Visited	‎05-03-2026 10:38 PM

Member Since	‎07-30-2020 02:04 AM
Last Visited	‎05-03-2026 10:38 PM
Posts	219
Kudos received	45

Cloudera Community

Re: Restore data from datanode after doing hdfs na...

Re: HBase "Master is initializing" error in pseudo...

Re: After upgrading Cloudera Manager to 7.11.3, Li...

Re: Can HDFS Rebalancer run without interrupted Pr...

Re: CM-HDFS

Re: JVM + still have Detected pause in JVM in spit...

Re: HBase PE randomRead test takes 40 minutes whil...

Re: How to limit/quota on network bandwidth to clo...

Re: How to remove old HDP stack after upgrading HD...

Re: How to remove old HDP stack after upgrading HD...

Re: Are there any effects of Spark2 by CVE-2022-33...

Re: Are there any effects of Spark2 by CVE-2022-33...

Re: Incorrect hard disk balancing between datanode

Re: Datanode IPC port TIME_WAIT

Re: REPORTSMANAGER multiple hprof files are gettin...