Member since
02-18-2019
83
Posts
3
Kudos Received
0
Solutions
03-26-2021
04:25 AM
Hello, Need some assistance / guidance on how we can reduce Non-HDFS Space. We see Non-HDFS Space of around 270 used, as we are facing space crunch, we would explore possibilities for getting non-hdfs space reduced. I have cleared all Yarn logs for the applications which were killed/ failed etc (our /data mountpoint houses dfs, yarn, kudu, impala), yet this does not solve our issue. Any assistance / guidance is much appreciated. Thanks Amn
... View more
Labels:
03-23-2021
09:17 PM
Hello, We are getting alerts for Block Count on one of our data nodes as it has crossed the threshold of 10000. Since HDFS balancer did not fix the issue, the next thing I turned my focus to see if we are hitting small files issue. I was trying to put up a report via terminal script ( hdfs dfs -ls -R /tmp |grep ^- |awk '{if ($5 < 134217728) print $5, $8;}'| head -5 | column –t) but when I compare the result from the script output vs HDFS Report from Cloudera Manager I see a difference in the size of the same file. Could anyone provide any guidance / assistance on this, or am I doing something wrong. Thanks Amn
... View more
Labels:
03-16-2021
03:03 AM
Thanks @tjangid Just one doubt does it matter if we keep two dash (--backend_client_rpc_timeout_ms ) or (-backend_client_rpc_timeout_ms ), please confirm. Thanks Amn
... View more
03-15-2021
10:26 AM
Hello, I need to increase the query timeout (backend_client_rpc_timeout_ms) from current 5 minutes to 30 minutes. Could anyone guide me from where in Impala Configurations I can do this change. I did some checking but cannot find any related, appreciate any assistance / guidance. Thanks Amn
... View more
Labels:
03-08-2021
06:16 PM
Hello, We are seeing concerning alert on one of our data node related to File Descriptor (Concerning: Open file descriptors: 16,410. File descriptor limit: 32,768. Percentage in use: 50.08%. Warning threshold: 50.00%.) Would appreciate any help/ guidance to fix this before it goes out of hand. [user1@myserver ~]$ ulimit -a core file size (blocks, -c) 0 data seg size (kbytes, -d) unlimited scheduling priority (-e) 0 file size (blocks, -f) unlimited pending signals (-i) 1030544 max locked memory (kbytes, -l) 64 max memory size (kbytes, -m) unlimited open files (-n) 1024 pipe size (512 bytes, -p) 8 POSIX message queues (bytes, -q) 819200 real-time priority (-r) 0 stack size (kbytes, -s) 8192 cpu time (seconds, -t) unlimited max user processes (-u) 4096 virtual memory (kbytes, -v) unlimited file locks (-x) unlimited [user1@myserver ~]$ cat /proc/sys/fs/file-max 26161091 [user1@myserver ~]$ cat /proc/sys/fs/file-nr 80400 0 26161091 Thanks Amn
... View more
Labels:
- Labels:
-
Apache Hadoop
-
Cloudera Manager
-
HDFS
11-10-2020
10:59 PM
Hello @PabitraDas, Appreciate your assistance, below is the block count on our DNs, as mentioned earlier we have allocated 6 GB JVM Heap for DN's and 10 GB Heap for NN & SNN. Do you suggest to increase DN Heap, or NN / SNN Heap as suggested by Shelton. Block Count: Node 1 = 7421379 Node 2 = 5569699 Node 3 = 6003009 Node 4 = 7444205 Node 5 = 8770674 Node 6 = 8849641 Node 7 = 8232779 Node 8 = 8354714 Node 9 = 8860602 Also, would greatly appreciate if you have any pointers / suggestions (scripts etc. ) to identify small file issue and possible remediation. Thanks Amn
... View more
11-05-2020
07:33 PM
@Shelton Apologies for the delay in replying. For my understanding, if possible, would you please explain how increasing NN Heap would fix DN Pause duration. Thanks in advance Amn
... View more
10-27-2020
12:41 AM
Hello @GangWar @Shelton Appericate your assistance, Following is the information available from NN WebUI- (23,326,719 files and directories, 22,735,340 blocks = 46,062,059 total filesystem object(s). Heap Memory used 5.47 GB of 10.6 GB Heap Memory. Max Heap Memory is 10.6 GB. Non Heap Memory used 120.51 MB of 122.7 MB Commited Non Heap Memory. Max Non Heap Memory is <unbounded>.) Could you please re-confirm whether I need to adjust the NN Heap Memory OR DN heap memory, as the issue is seen on data Node and that too only one data node other 8 seem to be running without any issues. Thanks Amn
... View more