About snm1523

snm1523 · ‎08-07-2024

Thank you for the response @AyazHussain. Possible to please reiterate "you need to have namespace updated for NN for RM" Thanks Snm1523

snm1523 · ‎08-06-2024

Hello, We are in process of adding new servers to existing CDP PB cluster. These servers will have master and worker roles along with Kafka brokers distributed / assigned to them. As part of the process, we will also decommission few servers which will be replaced by these new ones. Questions are: 1. What will be the functional impact of this change on data engineers / analysts / developers of team? 2. What changes they will need to do in their codes / applications / integrations / connections, etc? I am aware of few like: - Zookeeper ensemble will need to be modified - Teams will need to be informed of new brokers - Custom Knox topologies will need to be modified What are the other changes that will need to be done by the user / engineers group? 3. I am aware that connection strings will need to be modified. Which ones, just Hive or any other as well? Services that will be impacted of this change are: HDFS YARN HIVE IMPALA KAFKA HBASE Also, we have Tableau and other applications that connect to cluster to fetch reports / data. Kindly advise. Thanks Snm1523

snm1523 · ‎06-27-2024

For the benefit of anyone looking for this, Java Configuration Options for NodeManager Java Configuration Options for ResourceManager Java Configuration Options for <component name> are the configuration that needs to be updated with the -javaagent command. This allows to pick the Prometheus agent jar and enable JMX exporter to collect metrics.

snm1523 · ‎06-26-2024

Hello, We are on CDP PB 7.1.9 and goal is to monitor YARN applications and performance along with few HDFS metrics on Grafana via dashboards and ultimately trigger alerts. Metrics has to be collected via Prometheus agent and shared with Grafana. At this point, I have downloaded a Prometheus agent and created a below yml configuration. I understand this will collect all the metrics, which is intentional to start with. lowercaseOutputName: true rules: # All Gauge type Hadoop Metrics - pattern: 'hadoop<name=.*><>(Count|Value)' name: hadoop_${1}_gauge type: GAUGE # All Counter type Hadoop Metrics - pattern: 'hadoop<name=.*><>(Count|Value)' name: hadoop_${1}_counter type: COUNTER Prometheus agent jar and this config file is stored at /var/lib/prometheus_jmx_config/. For now testing this only on resource manager instances to verify if metrics are getting collected. Following few articles and Grafana documentation, I understood that I will need to run Prometheus as a Java agent (our case) or a stand alone HTTP server. To achieve this, we need to expose / enable JMX for the components we need to monitor (Resource Manager in this case) which has to be done by adding java agent command in hadoop-env.sh. Referring to the documentation, below is the command I think should work: YARN_RESOURCEMANAGER_OPTS="-javaagent:/var/lib/prometheus_jmx_config/jmx_prometheus_javaagent-0.20.0.jar=9091:/var/lib/prometheus_jmx_config/hadoop_jmx_exporter.yml" I tried adding this command in Gateway Client Environment Advanced Configuration Snippet (Safety Valve) for hadoop-env.sh of YARN configuration and it prompted to restart impacted services as this would change yarn-conf for multiple services. However, YARN service (RM to be specific) was never restarted. So I restarted it thinking it will execute prometheus agent as Java process and ultimately enable JMX on port 9091. However, the agent never started so JMX did not got enabled. We also have few Java related properties specific to components like below: Java Configuration Options for NodeManager Java Configuration Options for ResourceManager However, as I am not confident if those are correct, would be great if someone could advise on the same or if any configuration that I have missed. Thanks Snm1523

snm1523 · ‎06-21-2024

Hello @wbekker, Which one and how did you turned it off? Thanks snm1523

snm1523 · ‎06-07-2024

UPDATE: There were configs done incorrectly in topology files due to which I was unable to access Service UIs. Fixing the topology config file helped.

snm1523 · ‎06-05-2024

Thank you for the detailed explanation, @ShankerSharma. However, we ultimately had the engineering team along with developers who did this job. But I will keep this in my notes for reference.

snm1523 · ‎06-05-2024

Hello @Dennisleonn, Thank you for the detailed explanation and response. Certainly helped to understand the way Knox and Ranger work together. With respect to the issue Knox being not able to write the audit logs, I was able to get it through by changing the authorization type to "XASecurePDPKnox", which pushed Knox to use Ranger for authorizations and ultimately started writing audits to HDFS. However, I am now stuck on next issue where, I am unable to access the service URLs from Knox as regardless of the permissions in ranger policies for respective service, access is denied. Same is seen on Ranger Admin UI as well, which confirms ranger is denying access to service UIs via custom topology. All works okay with default (cdp-proxy) topology. I am pretty sure something basic is missed. But unable to get hold of it. Any clue on this? Thanks snm1523

snm1523 · ‎05-23-2024

Hello, Has anyone encountered an issue were Knox is not writing audit logs of specific topology. We have below topologies created including few of them migrated from HDP, however, necessary modifications were done and are listed in Knox UI. cdp-proxy cdp-proxy-api cdp-proxy-token health tokenexchange user1 - created for user group topo1 - created for user group and migrated from HDP topo2 - created for user group and migrated from HDP app - Used by app accounts Knox is successfully writing Ranger audit logs in HDFS for only cdp* topologies which were created by Cloudera during setup of Knox service and not for other. Written logs are visible in access tab of Audit section in Ranger Admin UI. We have total of 3 clusters and this is the case 2 clusters, for 1 cluster everything works fine. I have compared the configuration and also topology xmls and all seems correct (except for instance details which is obvious). Would it be anything related to Ranger or Solr configuration for Knox? However, if that is case it should be applicable to all topologies of Knox, why only non-default ones? Please help with suggestions / things to check / troubleshoot. Thanks snm1523

snm1523 · ‎05-20-2024

Hello @Scharan, I did found this sometimes back. 🙂 However, thank you for reconfirming my understanding. Thanks snm1523

Online	Offline
Last Visited	‎10-21-2025 11:53 PM

Member Since	‎10-29-2015 07:36 PM
Last Visited	‎10-21-2025 11:53 PM
Posts	128
Kudos received	31

Cloudera Community

Re: YARN and HDFS monitoring via Grafana

Re: Enable Admin account for Cloudera Manager

Re: Datanode not starting: SIGTERM error

Re: MKDirs failed to create file

Re: Add / Remove servers in cluster - CDP PB

Add / Remove servers in cluster - CDP PB

Re: YARN and HDFS monitoring via Grafana

YARN and HDFS monitoring via Grafana

Re: Ranger audit logs vs hdfs-audit.log / Securit...

Re: Knox not writing custom topology audit logs - ...

Re: Migrating Oozie jobs from HDP to CDP PB

Re: Knox not writing custom topology audit logs - ...

Knox not writing custom topology audit logs - CDP ...

Re: Delete unwanted knox topologies - CDP Private ...