About Babasaheb

Babasaheb · ‎02-06-2025

Hello @thoufeeq1218, We understand that you have configured spark.blockManager.driver.port and spark.blockManager.port, but Spark may still attempt to use random ports due to the following reasons: - Why is Spark using random ports? Spark uses additional ports beyond BlockManager for communication between the driver and executors. The random port seen (59698) is from the ephemeral port range (1024–65535) and could be assigned due to: spark.driver.port (default: random) spark.executor.port (default: random) - How to restrict Spark to specific ports? Explicitly set spark.driver.port to ensure the driver listens on a fixed port: --conf spark.driver.port=21800 Ensure spark.blockManager.port is available If 21700 is occupied, Spark will fall back to a random port. - Understanding spark.port.maxRetries If spark.port.maxRetries > 0 (default: 16), Spark will try additional ports within the ephemeral range. If spark.port.maxRetries = 0, Spark will fail immediately if the specified port is unavailable. - Executors and Dynamic Ports Executors start/stop dynamically and may request random ports. If you must prevent Spark from using random ephemeral ports, use the following settings: --conf spark.driver.port=21800 --conf spark.blockManager.driver.port=21750 --conf spark.blockManager.port=21700 --conf spark.executor.port=21810 --conf spark.port.maxRetries=0 These settings can be applied at the job level or in the Spark configuration file. Note: If a port is already used by a running job, a new job may fail due to a port conflict. If you found this response assisted with your query, please take a moment to log in and click on KUDOS 🙂 & ”Accept as Solution" below this post. Thank you.

Babasaheb · ‎10-22-2024

Hello @amru, We need the complete log file or a relevant log snippet to understand why Kafka is failing after enabling Ranger. But you should consider checking, Check Ranger’s audit logs to see if any access requests were denied, which could help you pinpoint the issue and the specific resource where permissions are lacking. You should check the Ranger logs for any errors or issues. Ensure that Kafka policies are synced successfully. If you are using AD/LDAP, ensure that all users are properly synced with Ranger. Thank you

Babasaheb · ‎09-25-2024

Hello @Israr , You should verify that both Kafka and Kafka Connect are running and in a healthy state. Thank you.

Babasaheb · ‎09-23-2024

Hello @ayukus0705 , A] I am looking for an option where we can directly read those hexadecimal escape sequences(i.e., ReportV10\x00\x00\x00\x00\x02\x02\x02) as it is in my spark dataframe. >> You will have to make sure that escape sequences are considered as raw binary data or strings without any spontaneous decoding or transformation. Following is an example to read as a binary: val df = spark.read.format("binaryFile").load("path of your file here") B] Alternatively, you can use the HBase Spark connector to load the data as binary. When using the HBase Spark connector, there is no need for any automatic decoding or transformation into the required format. Refer the following docs for more details: Private Cloud: https://docs.cloudera.com/cdp-private-cloud-base/7.1.9/accessing-hbase/topics/hbase-example-using-hbase-spark-connector.html? Public Cloud: https://docs.cloudera.com/runtime/7.2.18/accessing-hbase/topics/hbase-using-hbase-spark-connector.html? If you found this response assisted with your query, please take a moment to log in and click on KUDOS 🙂 & ”Accept as Solution" below this post. Thank you.

Babasaheb · ‎09-23-2024

Hello @Israr , Cloudera provides the easiest ways and options to configure such setups through Cloudera Manager (CM) and Streams Messaging Manager (SMM) [1] [1] https://docs.cloudera.com/cdp-private-cloud-base/7.1.8/monitoring-kafka-connect/topics/smm-creating-a-connector.html You can configure this pipeline with HDFS or Stateless NiFi Source and Sink connectors [2] & [3]. [2] Public Cloud: https://docs.cloudera.com/runtime/7.2.18/kafka-connect/topics/kafka-connect-connector-nifi-stateless.html [3] PvC: https://docs.cloudera.com/cdp-private-cloud-base/latest/kafka-connect/topics/kafka-connect-connector-nifi-stateless.html HDFS: https://docs.cloudera.com/cdp-private-cloud-base/7.1.9/kafka-connect/kafka-connect.pdf If you found this response assisted with your query, please take a moment to log in and click on KUDOS 🙂 & ”Accept as Solution" below this post. Thank you.

Babasaheb · ‎03-06-2024

Hello @pranoy, It may be possible that the user you are using to log in initially does not have access to some of the configuration files you use in the Kafka command and it's not able to get live brokers hence facing this issue. You should check the details you are using in the Kafka command and user privileges. If you found this response assisted with your query, please take a moment to log in and click on KUDOS 🙂 & ”Accept as Solution" below this post. Thank you.

Babasaheb · ‎03-06-2024

Hello @BrianChan, We should check the consumer offset topic (__consumer_offsets) health using the Kafka describe command in such issues. And check min.insync.replicas setting of this topic in describe command output. It should be less than or qual to topic ISR. For example: If the topic has replication factor 3 then min ISR should be 2 or 1 for failover. If you found this response assisted with your query, please take a moment to log in and click on KUDOS 🙂 & ”Accept as Solution" below this post. Thank you.

Babasaheb · ‎03-06-2024

Hello @hegdemahendra, 1) Please refer to the following article to connect Kafka from Nifi: https://community.cloudera.com/t5/Community-Articles/Integrating-Apache-NiFi-and-Apache-Kafka/ta-p/247433 2) Also, to isolate the issue you can try to connect Kafka from the same settings from nifi node using the Kafka command Please let us know if you still have any questions regarding the same or facing any issues. We will be happy to assist you with it. If you found this response assisted with your query, please take a moment to log in and click on KUDOS 🙂 & ”Accept as Solution" below this post. Thank you.

Babasaheb · ‎03-04-2024

Hello @steinsgate, The CDP Private Cloud Data Services will use dedicated OCP only so it doesn’t affect other services. If you found this response assisted with your query, please take a moment to log in and click on KUDOS 🙂 & ”Accept as Solution" below this post. Thank you.

Babasaheb · ‎03-04-2024

Hello @npdell, You can the monitoring system with the help of the CM Alerts regarding the same. Refer the following article for more details: https://docs.cloudera.com/cdp-private-cloud-base/7.1.8/monitoring-and-diagnostics/topics/cm-manage-alerts.html If you found this response assisted with your query, please take a moment to log in and click on KUDOS 🙂 & ”Accept as Solution" below this post. Thank you.

Online	Offline
Last Visited	‎11-26-2025 02:43 AM

Member Since	‎09-01-2020 02:38 AM
Last Visited	‎11-26-2025 02:43 AM
Posts	321
Kudos received	21

Cloudera Community

Re: Kafka Broker not starting after enabling range...

Re: How to read hexadecimal escape sequences from ...

Re: Hdfs to confluent kafka

Re: If CDP Private Cloud Data Services is installe...

Re: Using HDFS as local storage for yarn cluster d...

Re: Even after configuring the initial blockManage...

Re: Kafka Broker not starting after enabling range...

Re: Hdfs to confluent kafka

Re: How to read hexadecimal escape sequences from ...

Re: Hdfs to confluent kafka

Re: Kafka-Creating Topic-Ambari

Re: Kafka consumer cannot receive message from pro...

Re: Unable to consume message from kafka broker in...

Re: If CDP Private Cloud Data Services is installe...

Re: Yarn applicaton alerts