Member since
04-26-2016
6
Posts
3
Kudos Received
0
Solutions
05-19-2016
05:11 PM
2 Kudos
Hi, From your experience, what are the best practices for the following environments (development, testing, pre-production, production, data lab) in term of: High availability of master nodes High availability of edge nodes (knox, clients, etc) Security (kerberos, knox, ranger, etc) Master and slave node mixing % global of data to store Etc Thanks
... View more
05-18-2016
10:27 PM
Hi I'lI install an HDP cluster and I'm looking for information on how to partition my disks. I saw the doc but it's really light on the subject documentation Is there a more in depth doc ? Do you have general recommendations and best practices for disks partitioning ? For file systems ? Thanks
... View more
04-27-2016
01:40 AM
Thanks for these information. Right now I am working on the architectural level and these details will be useful for further steps
... View more
04-26-2016
03:36 PM
1 Kudo
Hi What are my options for HDFS replication in a DR scenario ? What are the pros and cons of each option ? Thanks
... View more
04-26-2016
01:18 AM
Hi I am building a data lake with hdp where kafka will be used to ingest all the data. I have two options. One cluster for everything and kafka is deployed exclusively on some node. One hdp cluster with storage and proceesing and another cluster with only kafka. What's the best approach ? Pros and cons ? How to size my kafka part ?
... View more