About Babasaheb

Babasaheb · ‎08-16-2022

@Sagar_cloudera Looks like the host has network connectivity issues. Please check if there are any network connectivity issues if not, then provide us the complete details/error you are getting while installing Cloudera so that we can suggest you on it further. Thank you.

Babasaheb · ‎08-10-2022

@yagoaparecidoti You can add these properties as additional properties in “HDFS Client Configuration Safety Valve for hdfs-site.xml”. dfs.client.block.write.replace-datanode-on-failure.policy=ALWAYS dfs.client.block.write.replace-datanode-on-failure.best-effort=true If you found this response assisted with your query, please take a moment to log in and click on KUDOS 🙂 & ”Accept as Solution" below this post. Thank you.

Babasaheb · ‎08-07-2022

@cfoo I see the following in the Nimbus log: [ERROR] Error when processing event org.apache.storm.generated.KeyNotFoundException: null Could you please try the below action plan and let us know how it goes: - Increase the JVM of the nimbus from the existing value to keep the nimbus up for a few additional moments to kill the topologies. - stop all the topologies - delete the znode /storm ( From zk cli : rmr /storm ) - take a backup of storm.local.dir and delete the same( From zk cli : rm -rf <value of stormlocal.dir> ). You can check it in the storm.yaml file - start the supervisors and nimbus, check if the services are up and running - If there is no Nimbus service running, then the only way to remove the topology processes would be to manually kill the worker processes on all worker nodes. If you found this response assisted with your query, please take a moment to log in and click on KUDOS 🙂 & ”Accept as Solution" below this post. Thank you.

Babasaheb · ‎08-07-2022

@Onkar_Gagre I would suggest you to tweak the following settings for better MirrorMaker scaling: 1) Kafka > Config > consumer.request.timeout.ms to 500000 Kafka > Config > session.timeout.ms to 60000 Kafka > Config > buffer.memory to 66000000 Kafka > Config > replica.fetch.max.bytes to 56700160 Kafka > Config > message.max.bytes to 56700160 2) You can increase the heartbeat.interval.ms to 20 secs in line with session.timeout.ms (1/3rd of session time). This has to be set in CM > Kafka > Config > Kafka MirrorMaker Advanced Configuration Snippet (Safety Valve) for mirror_maker_consumers.properties > heartbeat.interval.ms = 20000 > Save changes > Restart MM. 3) Add partition.assignment.strategy=org.apache.kafka.clients.consumer.RoundRobinAssignor to Kafka safety value for mirror.maker.consumer.properties If you still face the same issue, then provide us the following details for further investigation: 1) Are you getting any errors in the broker logs and MM logs? OR we need broker and MM logs 2) Run in each broker on the destination cluster a couple of times every 5 hours below to discard networking issues: netstat -s | egrep "retransmited|sent out|socket|buffer|overflow|total packets received" 3) How many brokers are there in the Kafka cluster? 4) Was it working fine before and have you made any recent changes? Thank you.

Babasaheb · ‎08-07-2022

@radha56 We need some more details to check and investigate this issue further: 1) Provide us the stdout and stderr you are getting while restarting broker from Ambari >> Kafka 2) Complete ERROR trace from the log file from failed broker node 3) Alert/WARN you are getting in Ambari >> Kafka >> Status page Thank you.

Babasaheb · ‎08-07-2022

@Onkar_Gagre A] There is no any standard tool that Kafka or Cloudera provides to migrate only Kafka zookeeper metadata. But yes, there are some third-party tools which you can use to migrate zookeeper metadata. You can use a third-party tool like zkcopy to migrate/copy metadata from one cluster to another. But we would recommend testing it in your dev lab first to avoid further issues Note: Just something to be aware of.. If you decide on copying over the znodes uses this third party tool then you need to do this when the Kafka service is shutdown. You would then copy it to the new Znode root location, change the chroot config in Kafka and start that up again. B] You can use SRM for migration and once the migration is completed you can lower the retention settings 1-2 hours as you don't need data so that all data will be cleared automatically and then you can set retetion back to 7 days which is default or as per your requirement. C] You can also create all topics on a new cluster with the required configuration If you found this response assisted with your query, please take a moment to log in and click on KUDOS 🙂 & ”Accept as Solution" below this post. Thank you.

Babasaheb · ‎08-07-2022

@CDPTester You can use camel-snmp-kafka-connector to receive traps and poll SNMP (Simple Network Management Protocol) and configure it to send to Kafka topic [1]. [1] https://camel.apache.org/camel-kafka-connector/0.11.x/reference/connectors/camel-snmp-kafka-source-connector.html This connector will be not bundled with the CDP but you can integrate this Apache open source connector as per your requirement. Refer the following stackoverflow article for more details/example [2] [2] https://stackoverflow.com/questions/54305856/snmp-camel-kafka If you found this response assisted with your query, please take a moment to log in and click on KUDOS 🙂 & ”Accept as Solution" below this post. Thank you.

Babasaheb · ‎08-06-2022

@Venkatd As per the initial description, it looks like you have installed the required components manually and trying integrate them but you are getting the following error: ERROR: Exception in thread "main" java.lang.NoClassDefFoundError Kindly note that the above exception usually occurs when the dependencies in the classpath are missing that are required for the script/component. First thing please very kafka-client jar (for example kafka-client-2.13-2.5.0.jar) exists in the classpath. If yes, then it requires "commons-lang" jar/dependencies in your classpath. If you found this response assisted with your query, please take a moment to log in and click on KUDOS 🙂 & ”Accept as Solution" below this post. Thank you.

Babasaheb · ‎08-06-2022

@yagoaparecidoti A] Generally /config/topics/ will store all topic level configuration if you have set any. For example: If you have set any topic level setting then it will be stored there: [zk: node1.zookeeper-hostname.com:2181(CONNECTED) 6] get /kafka/config/topics/ATLAS_HOOK {"version":1,"config":{"retention.ms":"2678400000","cleanup.policy":"compact"}} [zk:node1.zookeeper-hostname.com:2181(CONNECTED) 7] Here topic level settings are retention.ms and cleanup.policy=compact As it has only topic-level configuration details it will not affect the Kafka cluster or applications if applications are not dependent on any topic-level settings. B] There is no standard Kafka tool which you can use to restore /config/topics but you can easily alter/set topic level settings using the Kafka command below as per your retirement if any. kafka-configs --zookeeper <ZK_host>:2181/kafka --entity-type topics --entity-name <topic_name> --alter --add-config <configname & value> Note: Replace all required settings and /kafka (/kafka can be checked from CM >> Kafka >> Conf >> zookeeper.chroot) in the above command If you found this response assisted with your query, please take a moment to log in and click on KUDOS 🙂 & ”Accept as Solution" below this post.

Babasaheb · ‎08-06-2022

@Jarinek Please check my inline answer to your queries: 1) is it possible to alter compression.type on a kafka topic while having applications on the topic running? >> Yes, we would recommend stopping the producer and then alert compression type as whenever the Kafka broker gets a compressed batch of messages from a producer. It will decompress the data to validate if the compression type is different at the topic level and producer level and if compression is the same from both sides, then it will write directly to the topic/leader partition. 2) What should be expected hard drive saving factor for text records like json (2KB size) and compression.type=gzip. >> You should use and test snappy or lz4 for best speed/compression ratio 3) Does it apply to KafkaMirrorMaker only? >> Yes, as per the screenshot it will apply for the MM only. 4) Are producer and consumer applications somehow affected, when the global parameter changes. >> If you are changing it from the Kafka side, then it should not affect the applications. If you found this response assisted with your query, please take a moment to log in and click on KUDOS 🙂 & ”Accept as Solution" below this post. Thank you.

Online	Offline
Last Visited	‎08-12-2025 07:34 PM

Member Since	‎09-01-2020 02:38 AM
Last Visited	‎08-12-2025 07:34 PM
Posts	321
Kudos received	21

Cloudera Community

Re: Kafka Broker not starting after enabling range...

Re: How to read hexadecimal escape sequences from ...

Re: Hdfs to confluent kafka

Re: If CDP Private Cloud Data Services is installe...

Re: Using HDFS as local storage for yarn cluster d...

Re: getting error when deploying cloudera on gcp

Re: how to configure fs.client.block.write.replace...

Re: Storm Nimbus fail to start with error org.apac...

Re: Mirrormaker 2.0 delay and how to identify bott...

Re: Ambari metrics is showing stopped kafka broker...

Re: How to copy topic configuration (only metadata...

Re: how to run Kafka console producer in a remote ...

Re: Unable to integrate kafka and hbase into atlas...

Re: URGENT - accidentally deleted /config/topics d...

Re: questions about kafka and compression.type