Created 09-26-2016 12:01 PM
I am trying to build a system of big-data platform ,with the tools hadoop, hive, hbase, pig, hive, storm, zookeeper. As per my knowledge we can either install it using tarball file given on apache website for each tools or we can use HDP repositories and ambari to build the things.( Am not looking for Cloudera ). As mentioned their respective document both come with Apache Licence 2.0. .
Created 09-26-2016 04:30 PM
1) HDP is a stack that is maintained by Hortonworks. It is a collection of services and versions of the services certified by Hortonworks to work together as a hadoop system. With a version of HDP "stack", you will have a recommended set of versions of services installed.
You can see the growth of the HDP stack in the diagram titled "Ongoing innovation in Apache", here :
http://hortonworks.com/products/data-center/hdp/
2) To use HDP repo you don't need an enterprise license. HDP is completely open source
3) Before starting off things in a production system, you may want to check install using sandbox and get familiar with HDP:
http://hortonworks.com/hadoop-tutorial/learning-the-ropes-of-the-hortonworks-sandbox/
and then go ahead and look at :
To get a starting point into HDP docs, look at :
http://hortonworks.com/downloads/#data-platform
and
http://docs.hortonworks.com/index.html -- This has docs for every version of HDP and ambari
Created 09-26-2016 04:30 PM
1) HDP is a stack that is maintained by Hortonworks. It is a collection of services and versions of the services certified by Hortonworks to work together as a hadoop system. With a version of HDP "stack", you will have a recommended set of versions of services installed.
You can see the growth of the HDP stack in the diagram titled "Ongoing innovation in Apache", here :
http://hortonworks.com/products/data-center/hdp/
2) To use HDP repo you don't need an enterprise license. HDP is completely open source
3) Before starting off things in a production system, you may want to check install using sandbox and get familiar with HDP:
http://hortonworks.com/hadoop-tutorial/learning-the-ropes-of-the-hortonworks-sandbox/
and then go ahead and look at :
To get a starting point into HDP docs, look at :
http://hortonworks.com/downloads/#data-platform
and
http://docs.hortonworks.com/index.html -- This has docs for every version of HDP and ambari
Created 09-26-2016 05:43 PM
@sbhat is correct. I would like to add to her response that HDP stack is 100% open source based on Apache. It is a tested platform as such tools from the ecosystem can work together and deliver enterprise level quality. Taking the tools from Apache does not assure that they will work smoothly together.
There is no concept of license associated with HDP. You can use the distribution as-is, however, enterprises elect to purchase paid support as such that can receive 24x7 support and get the chance to influence the roadmap or receive special attention on critical issues. Hortonworks engineers are actively involved in Hadoop ecosystem tools development and they can help with addressing bugs or including features that the community would like to have added.
Best for you would be to start with downloading the sandbox as @sbhat suggested.
Good luck!