Created on 11-30-2015 07:26 AM - edited 09-16-2022 01:33 AM
Background:
The Zeppelin Ambari service has been updated and now supports installing latest Zeppelin version (0.5.5) on HDP using blueprints to automate creation of 'data science' cluster. SequenceIQ team have a datascientist blueprint that installs Zeppelin, but based on my conversations with @lpapp, its based on an older version of the Ambari service so does not install the latest or support as many options.
Below is a writeup of how to deploy Zeppelin via blueprints. Note that if you already have a cluster running, you should just use the Add service wizard in Ambari to deploy Zeppelin using the steps on the github
Pre-reqs:
Bring up 4 VMs imaged with RHEL/CentOS 6.5 or later (e.g. called node1-4 in this case).
Note that the VMs should not already have HDP related software installed on them at this point.
export ambari_server=node1 curl -sSL https://raw.githubusercontent.com/seanorama/ambari-bootstrap/master/ambari-bootstrap.sh | sudo -E sh
export install_ambari_server=true curl -sSL https://raw.githubusercontent.com/seanorama/ambari-bootstrap/master/ambari-bootstrap.sh | sudo -E sh yum install -y git git clone https://github.com/hortonworks-gallery/ambari-zeppelin-service.git /var/lib/ambari-server/resources/stacks/HDP/2.3/services/ZEPPELIN
/var/lib/ambari-server/resources/stacks/HDP/2.3/role_command_order.json
file to include below:"ZEPPELIN_MASTER-START": ["NAMENODE-START", "DATANODE-START"],
service ambari-server restart service ambari-agent restart
curl -u admin:admin -H X-Requested-By:ambari http://localhost:8080/api/v1/hosts service ambari-agent status
yum install -y python-argparse git clone https://github.com/seanorama/ambari-bootstrap.git #optional - limit the services for faster deployment #for minimal services export ambari_services="HDFS MAPREDUCE2 YARN ZOOKEEPER HIVE ZEPPELIN" #for most services #export ambari_services="ACCUMULO FALCON FLUME HBASE HDFS HIVE KAFKA KNOX MAHOUT OOZIE PIG SLIDER SPARK SQOOP MAPREDUCE2 STORM TEZ YARN ZOOKEEPER ZEPPELIN" export deploy=false cd ambari-bootstrap/deploy bash ./deploy-recommended-cluster.bash cd tmpdir* #edit the blueprint to customize as needed. You can use sample blueprints provided below to see how to add the custom services. vi blueprint.json #edit cluster file if needed vi cluster.json
#Pick one of the below blueprints #for minimal services download this one wget https://raw.githubusercontent.com/hortonworks-gallery/ambari-zeppelin-service/master/blueprint-4node... -O blueprint-zeppelin.json #for most services download this one wget https://raw.githubusercontent.com/hortonworks-gallery/ambari-zeppelin-service/master/blueprint-4node... -O blueprint-zeppelin.json
#Pick one of the below blueprints #for minimal services download this one wget https://raw.githubusercontent.com/hortonworks-gallery/ambari-zeppelin-service/master/blueprint-1node... -O blueprint-zeppelin.json
vi blueprint-zeppelin.json
"zeppelin.host.publicname":"<public IP or hostname of zeppelin node>"
so the Zeppelin Ambari view is pointing to external hostname (instead of the internal name, which is the default)#upload the blueprint to Ambari curl -u admin:admin -H X-Requested-By:ambari http://localhost:8080/api/v1/blueprints/zeppelinBP -d @blueprint-zeppelin.json
#for 4 node setup wget https://raw.githubusercontent.com/hortonworks-gallery/ambari-zeppelin-service/master/cluster-4node.j... -O cluster.json #for single node setup wget https://raw.githubusercontent.com/hortonworks-gallery/ambari-zeppelin-service/master/cluster-1node.j... -O cluster.json
vi cluster.json
curl -u admin:admin -H X-Requested-By:ambari http://localhost:8080/api/v1/clusters/zeppelinCluster -d @cluster.json
Created on 04-21-2016 12:43 PM
@Ali Bajwa A simplified approach: On the Ambari Server:
yum -y install git git clone https://github.com/seanorama/ambari-bootstrap cd ambari-bootstrap export ambari_server_custom_script=${ambari_server_custom_script:-~/ambari-bootstrap/ambari-extras.sh} export install_ambari_server=true ./ambari-bootstrap.shThen deploy the cluster. The "extras" script above takes care of all the tedious stuff automatically (cloning Zeppelin, the blueprint defaults, the role command order, ...).
yum -y install python-argparse cd deploy export ambari_services="HDFS MAPREDUCE2 YARN ZOOKEEPER HIVE SPARK ZEPPELIN" bash ./deploy-recommended-cluster.bash