Created on
12-15-2019
04:51 PM
- last edited on
12-15-2019
06:06 PM
by
ask_bill_brooks
Hello,
I need to create a distributed system with 2 clusters (many nodes in each) cluster, witch solution is required ? and how can i do it knowing that i am a beginner with big data technologies ?
I need a tool to manage these clusters ( ex use HDFS with 2 clusters ).
every information can be very helpful.
Thank you.
Created 12-15-2019 07:44 PM
Hi @Hakim
For bigdata technology you can adopt Hadoop platform for build distributed cluster.
You can follow link below which will guide you to setup distributed cluster irrespective of number of nodes -
https://docs.cloudera.com/documentation/enterprise/6/6.3/topics/installation.html
Note: You can test for trail subscription which has limited no of service features.
Also Cloudera Manager is Tool which allows you to manage this cluster.
Created 12-15-2019 08:52 PM
Hello,
thank you @sagarshimpi.
I am using HDP end Cloudera but i don't know how can use them for 2 clusters for example.
i need to use HDFS with 2 clusters , every cluster has 3 datanodes.
do you have an idea how can i handle this ?
Created 12-15-2019 10:51 PM
Hi @Hakim
To what i know you cannot use entire hdfs across 2 clusters.
If you want you can use HDFS NFS gateway which will allow you to mount hdfs filesystem as mount point on remote OS.