Created on 01-19-2016 10:40 PM - edited 09-16-2022 02:58 AM
Since Hadoop is not a typical Enterprise software, we are having trouble getting the QA team to understand how it fits into our application landscape. They would like us to have three separate environments for Dev, QA and Production. Do you typically see this, or do you have any best practice documentation that we could provide to them?
Created 01-20-2016 12:39 PM
@Ancil McBarnett I see this pattern in all my customers. Dev tends to be small. Sometimes dev is comprised of only sandbox instances and is almost always a virtual environment. Test mimics prod in all configuration aspects but tends to be about 30%-50% prod capacity.
Upgrades, configuration changes, patching, tech previews all occur in the test environment prior to any production rollout. In the end, Hadoop isn't much different than other platforms as far as this is concerned.
Created 01-19-2016 11:10 PM
Yes to 3 environment
Dev and Qa does not need to as big as prod.
DR is required too and we can use DR for reporting
Created 01-20-2016 12:39 PM
@Ancil McBarnett I see this pattern in all my customers. Dev tends to be small. Sometimes dev is comprised of only sandbox instances and is almost always a virtual environment. Test mimics prod in all configuration aspects but tends to be about 30%-50% prod capacity.
Upgrades, configuration changes, patching, tech previews all occur in the test environment prior to any production rollout. In the end, Hadoop isn't much different than other platforms as far as this is concerned.
Created 01-20-2016 06:34 PM
I believe, DEV -> QA -> PROD is a minimum and I have seen larger organizations deploy LAB -> DEV -> QA -> PROD -> DR as separate clusters.
Created 02-02-2016 04:27 PM
@Ancil McBarnett please accept best answer
Created 10-26-2016 10:35 PM
Could you elaborate further on how DR cluster can be used for reporting?
Many thanks
Created 02-13-2017 09:49 AM
Having 4 environments including development, testing, pre-production/staging and production in a Big company would be good for best practices because in staging we can make sure that all are working properly. Of course the dev, testing and staging environments are smaller than planned production. For instance, if I take 2 nodes in dev, testing and staging then we can have a almost 8 nodes in production and again it's always depends on replication, traffic, and other relevant facts. Thanks!