Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Druid installation

Druid installation

New Contributor

Hi,

To install below Druid components, I need to choose a server for each of these (Master/datanode).

1.coordinator

2.superset

3.router

4.overlord

5.broker


We have 3 master nodes on our cluster. Can anyone help me in understanding which component has to be installed on what node (data node/Master node)

Thanks.

2 REPLIES 2

Re: Druid installation

Rising Star

Please refer below answer from https://community.hortonworks.com/questions/140030/druid-installation.html

First, yes you can co-locate all those service together. Second in order to get high availability you need to have at least 2 different physical nodes running all the services. Thus you will get HA with a replication of 2. Or you can choose an other combination of collocation where each service is run at least over 2 different nodes.

Although ideally you want to have something like this. Node1 Broker Node2 Broker Node3 Router/Overlord/Coordinator/Superser Node4 Router/Overloard/Coordinator/Superset The reason what you need broker to be alone is the fact that broker usually needs way more memory than all the other together therefore you might have special hardware for that. But to keep it simple you can start with collocate all the services X 2 and make sure that broker is not running with another service that needs RAM as well.

Highlighted

Re: Druid installation

New Contributor

@Sowmya K

The components of druid, coordinator, router, overlord and broker, are working closely together. Each of them has it's own task. You can find an overview here. Mainly they are coordinating the druid jobs. So I recommend that you put them together on a master node. Superset is an extra component. You can put it on another master node or on the same that does not matter.

Durid also need's Druid Historical and Druid MiddleManager components. This you should install on the Data Nodes.

Here is an answer to a similiar questions like your's with some more informations.

https://community.hortonworks.com/questions/140030/druid-installation.html

If you are intrested in an architecture overview of druid i can advise slide five of this presentation.

https://www.slideshare.net/Hadoop_Summit/interactive-analytics-at-scale-in-apache-hive-using-druid-8...

I hope i could help you.

Regards,

Michael

Don't have an account?
Coming from Hortonworks? Activate your account here