About AyazHussain

AyazHussain · ‎04-22-2024

@ryu 1.Check the queue that job is running in. See if that is allocated enough resources. 2.See if the queue is pending containers is showing or not. 3.If all these things are fine then start checking the locality if the job is running in node local or rack local. 4. Then go to node manager level and debug for local unix level slowness

AyazHussain · ‎04-21-2024

@yagoaparecidoti If my suggestion helped Please accept it as a solution

AyazHussain · ‎04-21-2024

As a rule of thumb assign 80% of the node resources to YARN. Please go through this to modify the config according to your need. https://hadoop.apache.org/docs/r2.7.2/hadoop-yarn/hadoop-yarn-site/CapacityScheduler.html#Queue_Properties

AyazHussain · ‎04-21-2024

Please check the umask that is set for the yarn directories.

AyazHussain · ‎04-16-2024

Hi @mike_bronson7 Please check that if the RM is working fine or is it down. Please also check your zookeeper. Please check with telnet if you can connect to RMhost from other host.

AyazHussain · ‎04-16-2024

There could be multiple cause of this issue. But at first please check the permission of the log directory for that node manager. The Node Manager enforces that the remote root log directory exists and it has the correct permission settings. A warning is emitted in the Node Manager logs if the folder does not exist or exists but with incorrect permissions (e.g. 1777). If created, the directory’s owner and the group will be the same user as the Node Manager’s user, and group. The group is configurable, which is useful in scenarios where the Job History Server (or JHS in short) is running in a different UNIX group than the Node Manager that can prevent aggregated logs from being deleted. Because directly under the filesystem's root, each user has its own directory, everything under the user’s directory is created with 0770 permissions, so that only the specific user and the hadoop group are allowed to access those directories and files. Each individual aggregated log file will be created with 0640 permissions - providing rw access to the user and read-only access to the hadoop group. Since the directory has 0770 permission, the members of the hadoop group will be able to delete these files, which is important for the automatic deletion

AyazHussain · ‎02-13-2024

Do this to fix your issue: change the config something like this ... add yaml env variable "${dir_for_config}" note that '~/' is bash expansion [0] eg: ... conf.yml dbConfig: dbType: H2 driver: org.h2.Driver url: jdbc:h2:~/${dir_for_config}/config-service;DB_CLOSE_DELAY=-1;AUTO_RECONNECT=TRUE;DB_CLOSE_ON_EXIT=FALSE In CM WebUI CM> Yarn Queue...>Conf> find "YARN Queue Manager Service Environment Advanced Configuration Snippet (Safety Valve)"add: key: dir_for_config value: x

AyazHussain · ‎02-13-2024

Please provide the user and put a comma and then put the space. eg yarn , hdfs , mapred.

AyazHussain · ‎02-12-2024

Yes you can use but the problem with that will be you have to allow all the users to access that directory. YARN can move local logs securely onto HDFS or a cloud-based storage, such as AWS. This allows the logs to be stored for a much longer time than they could be on a local disk, allows faster search for a particular log file and optionally can handle compression.

AyazHussain · ‎02-08-2024

The application ask for container run some part in that container and then release it back. So the 28 vcores that you are seeing is due to that. Let's say your job asks for 4 containers and eack with 7 vcores so at first only two containers will run as you have limit of 15 vcores. but if one container is released then that job will take another container with 7 vcores so in total now the number of vcores used is 21.

Online	Offline
Last Visited	‎09-23-2025 03:26 AM

Member Since	‎12-20-2022 08:28 AM
Last Visited	‎09-23-2025 03:26 AM
Posts	88
Kudos received	19

Cloudera Community

Re: Resource manager heap calculation

Re: Iceberg with Ranger

Re: Node Manager Down

Re: Sachin Duggal : Can Block-Level Data Be Used t...

Re: How to get total_io_mb of eatch applications i...

Re: What is the best way to troubleshoot an applic...

Re: yarn run logs - error getting logs at hostname...

Re: What are the best config memory and vcore in Y...

Re: yarn modifies directory permissions

Re: YARN resource manager "HTTP request sent, awai...

Re: yarn run logs - error getting logs at hostname...

Re: Failed to start role of YARN Queue Manager

Re: I want to enable ACL with hadoop uses.

Re: Yarn Log Aggregation Directory

Re: Cloudera 7.4.4 - Yarn - Questions about Queues