Community Articles

Find and share helpful community-sourced technical articles.
Celebrating as our community reaches 100,000 members! Thank you!
Super Collaborator

There are many ways to run Hadoop on virtual machines. Earlier this year I tried several approaches, and ended up using a helpful Quick Start Guide written by Yusaku Sako. The Quick Start uses VirtualBox, Vagrant, and predefined scripts to set up a multi-node HDP cluster. You can choose which version of Ambari to install, and then choose and install an associated version of the HDP stack.

For anyone new to virtual machines, there is now a Quick Start for New VM Users. The extended version adds background information and additional details for installing Ambari and the HDP stack. Topics include:

  • Terminology
  • Prerequisites
  • Installing VirtualBox and Vagrant
  • Starting Linux Virtual Machines
  • Accessing Virtual Machines
  • Installing Ambari
  • Installing the HDP Stack
  • Troubleshooting
  • Reference information for basic Vagrant commands