About JordanMoore

JordanMoore · ‎07-30-2018

@Michael Bronson - The terms "master/worker" don't really mean anything in Kafka terms. 17 Kafka brokers seems like a lot (we have about that many brokers in AWS handling about 2million messages per day), but yes, a minimum of 5 ZKs is encouraged to account for maintenance and hardware failure, as mentioned.

JordanMoore · ‎07-25-2018

There is no such rule for Kafka Brokers. Zookeeper should maintain a quorum or (n/2 + 1) total machines (of n) that agree on leader-election values and locks, that results in a total odd number to accommodate for hardware and network failure scenarios. From "Kafka - The Definitive Guide", as well as Apache Zookeeper site, you generally will have negative side effects from having more than 5 or 7 Zookeeper servers total serving applications using it. You should have more than 3 Zookeepers because if one goes down, you are only left with 2, which results in that "split brain". With 5 servers, two can go down, and you still have 2 servers + 1 available for the "tie breaker" vote. For 7, you can loose up to 4 zookeepers and still be good.

JordanMoore · ‎07-20-2018

TaskTracker & JobTracker doesn't exist with YARN. The default replication factor is 3.

JordanMoore · ‎07-20-2018

What component are you asking about? What are you trying to achieve? They typically call each other over combinations of separate protocols. - HDFS and YARN interact via RPC/IPC. - Ambari Server and Agents are over HTTP & REST. Ambari also needs JDBC connections to the backing database. - Hive, Hbase, and Spark can use Thrift Server. The Hive metastore uses JDBC. - Kafka has its own TCP protocol. I would suggest starting on a specific component for the use case(s) you want. Hadoop itself is only comprised of HDFS & YARN + MapReduce

JordanMoore · ‎07-16-2018

@Sambasivam Subramanian By definition, an edge node is just a host only with clients installed and configured. If you install no server services in Ambari for a host, then you will end up with an edge node for the clients that you selected.

JordanMoore · ‎05-19-2018

The configs are on the top line. It will say "Configs: " if none are customized $ kafka-topics --describe --topic $TOPIC --zookeeper $ZOOKEEPER Topic:******** PartitionCount:20 ReplicationFactor:3 Configs:retention.ms=10800000

JordanMoore · ‎05-19-2018

There is no such support for renaming https://issues.apache.org/jira/browse/KAFKA-2333 If you want to clone, then use MirrorMaker https://community.hortonworks.com/articles/79891/kafka-mirror-maker-best-practices.html

JordanMoore · ‎05-14-2018

@Michael Bronson Kafka stores the latest offsets in memory before they are sent to disk, therefore, the more memory the better, with a max of 8G. And I would assume that the heap properties can be set from Ambari rather than individually on the broker, but I don't use Kafka from HDP, so I can't say.

JordanMoore · ‎05-11-2018

The recommendation here would be to increase the heap space allocated to the Kafka process or reduce the amount of other processes running on the same server. For example, in a production environment, the Kafka brokers should be standalone servers -- not on the same hardware as Zookeeper or other Hadoop processes.

JordanMoore · ‎04-10-2018

Yes, the commands work the same assuming you have winutils.exe on your PATH as well as HADOOP_HOME and HADOOP_CONF_DIR defined as environment variables. Windows is not as stable or as supported as Linux, however.

Online	Offline
Last Visited	‎12-07-2015 12:15 PM

Member Since	‎11-19-2015 11:49 AM
Last Visited	‎12-07-2015 12:15 PM
Posts	158
Kudos received	25

Cloudera Community

Re: what is the most best monitoring tool for hado...

Re: What are the resources and technologies requir...

Re: How can I run kafka connect to import data fro...

Re: HDP Component working in deep

Re: I want to add an additional edge node to my ex...

Re: why kafka should be un-even number

Re: why kafka should be un-even number

Re: " What is cluster, single node cluster and nod...

Re: HDP Component working in deep

Re: I want to add an additional edge node to my ex...

Re: define topic retention period with kafka

Re: Kafka topic rename

Re: Cant start kafka broker

Re: Cant start kafka broker

Re: does hadoop works same on windows as linux?