<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Where is Kafka have to be installed ? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Where-is-Kafka-have-to-be-installed/m-p/171704#M50122</link>
    <description>&lt;P&gt;&lt;STRONG&gt;Setup:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;To get maximum fault tolerance / performance, install Kafka on each node (source server) so you will end up with 3 nodes. Additionally you will also need zookeeper which you can once again install on all 3 nodes.&lt;/P&gt;&lt;P&gt;The above setup is recommended if this is a PoC, however for production use it's recommended to have Kafka + Zookeeper on nodes other than your source nodes to provide fault tolerance. Additionally, Kafka uses a lot of OS page caching which may interfere with the application running the 3 nodes.&lt;/P&gt;&lt;P&gt;Just to clarify Kafka shouldn't be confused with Flume, it's a MessageBroker service; you are responsible for ingesting data (use Flume) or reading data (e.g. Storm / Spark)&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Scaling:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Scaling Kafka is a 3 step operation; step 3 is optional but recommended:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Add nodes to the cluster (Use Ambari to add nodes)&lt;/LI&gt;&lt;LI&gt;Alter Topic and add additional partitions (1 partition / node)&lt;/LI&gt;&lt;LI&gt;(Optional) Rebalance Kafka &lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;
On a side note: Kafka works best when used in clustered mode, you can use single-node Kafka however it it fundamentally defeats the purpose of Kafka (partitioning and fault-tolerance)&lt;/P&gt;</description>
    <pubDate>Fri, 30 Dec 2016 02:45:25 GMT</pubDate>
    <dc:creator>ambud_sharma1</dc:creator>
    <dc:date>2016-12-30T02:45:25Z</dc:date>
    <item>
      <title>Where is Kafka have to be installed ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Where-is-Kafka-have-to-be-installed/m-p/171701#M50119</link>
      <description>&lt;P&gt;if i have three server as log source and one server for place where log collected. where should i install kafka? &lt;/P&gt;&lt;P&gt;- i have to install it on each server or &lt;/P&gt;&lt;P&gt;- i need to add one more server to install kafka between source server and server where data collected&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;</description>
      <pubDate>Mon, 26 Dec 2016 15:41:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Where-is-Kafka-have-to-be-installed/m-p/171701#M50119</guid>
      <dc:creator>anggriawanrezab</dc:creator>
      <dc:date>2016-12-26T15:41:38Z</dc:date>
    </item>
    <item>
      <title>Re: Where is Kafka have to be installed ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Where-is-Kafka-have-to-be-installed/m-p/171702#M50120</link>
      <description>&lt;P&gt;@&lt;A href="https://community.hortonworks.com/users/15069/anggriawanrezabramantya.html"&gt;Bramantya Anggriawan&lt;/A&gt;&lt;/P&gt;&lt;P&gt;ideally you should install the kafka broker on the nodes where logs should be collected, topic partition is the unit of parallelism in Kafka. On both the producer and the broker side, writes to different partitions can be done fully in parallel.if you dont have many topics on which you need to produce data to then you can have 1-2 kafka server&lt;/P&gt;</description>
      <pubDate>Mon, 26 Dec 2016 15:45:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Where-is-Kafka-have-to-be-installed/m-p/171702#M50120</guid>
      <dc:creator>rajkumar_singh</dc:creator>
      <dc:date>2016-12-26T15:45:58Z</dc:date>
    </item>
    <item>
      <title>Re: Where is Kafka have to be installed ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Where-is-Kafka-have-to-be-installed/m-p/171703#M50121</link>
      <description>&lt;P&gt;so i only have to install Kafka in one server where the logs 
collected, how if i want to add more server for kafka cluster?&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;</description>
      <pubDate>Tue, 27 Dec 2016 16:50:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Where-is-Kafka-have-to-be-installed/m-p/171703#M50121</guid>
      <dc:creator>anggriawanrezab</dc:creator>
      <dc:date>2016-12-27T16:50:49Z</dc:date>
    </item>
    <item>
      <title>Re: Where is Kafka have to be installed ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Where-is-Kafka-have-to-be-installed/m-p/171704#M50122</link>
      <description>&lt;P&gt;&lt;STRONG&gt;Setup:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;To get maximum fault tolerance / performance, install Kafka on each node (source server) so you will end up with 3 nodes. Additionally you will also need zookeeper which you can once again install on all 3 nodes.&lt;/P&gt;&lt;P&gt;The above setup is recommended if this is a PoC, however for production use it's recommended to have Kafka + Zookeeper on nodes other than your source nodes to provide fault tolerance. Additionally, Kafka uses a lot of OS page caching which may interfere with the application running the 3 nodes.&lt;/P&gt;&lt;P&gt;Just to clarify Kafka shouldn't be confused with Flume, it's a MessageBroker service; you are responsible for ingesting data (use Flume) or reading data (e.g. Storm / Spark)&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Scaling:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Scaling Kafka is a 3 step operation; step 3 is optional but recommended:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Add nodes to the cluster (Use Ambari to add nodes)&lt;/LI&gt;&lt;LI&gt;Alter Topic and add additional partitions (1 partition / node)&lt;/LI&gt;&lt;LI&gt;(Optional) Rebalance Kafka &lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;
On a side note: Kafka works best when used in clustered mode, you can use single-node Kafka however it it fundamentally defeats the purpose of Kafka (partitioning and fault-tolerance)&lt;/P&gt;</description>
      <pubDate>Fri, 30 Dec 2016 02:45:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Where-is-Kafka-have-to-be-installed/m-p/171704#M50122</guid>
      <dc:creator>ambud_sharma1</dc:creator>
      <dc:date>2016-12-30T02:45:25Z</dc:date>
    </item>
  </channel>
</rss>

