Support Questions

arsalan_siddiqi · ‎08-12-2017

Hi

I tried to install apache atlas on my laptop unfortunately I keep on getting exceptions. Can any one make a tutorial to get atlas running on your laptop with embeded Solr and Hbase setting using the link:

Apache Atlas Installation

How to configure it and what needs to be installed before hand (zookeeper). I just need a basic installation with defaults.

It would be great to mention

Checkout URL
maven commands
additional dependencies (softwares)
configurations file locations and settings

Once you download the repo there are multiple similar folders with the same content which I am confused about, regarding which one to use and where to run the scripts like start_atlas.py etc

Thanks

arsalan_siddiqi · ‎08-24-2017

Ok here are all the steps required to run Apache Atlas natively with Berkeley DB and Elastic:

Download and install Kafka use the link : https://kafka.apache.org/downloads. Download the binary and extract to your required location.
Kafka and Atlas would also require Zookeeper. By default kafka comes with an instance of zookeeper. If you do not have zookeeper running or installed, you can use this. Navigate to and run : kafkahome/bin/zookeeper-server-start.sh
Once zookeeper has started you can check it using the command: netstat -ant | grep :2181. if everything is fine you should see: tcp6 0 0 :::2181 :::* LISTEN
Now you can start your kafka server using the command: ./kafkaHOME/bin/kafka-server-start.sh /KafkaHome/config/server.properties
To check if kafka is running run the command netstat -ant | grep :9092. You should see a similar result as mentioned above.
Now you are ready to move on with ATLAS. You can either use the link provided on the website or do a branch and tag checkout directly from github. I used the command on their website: git clone https://git-wip-us.apache.org/repos/asf/atlas.git atlas
navigate into the folder : cd atlas
Create new folder called libext using: mkdir libext
You need to download the jar file form this URL. http://download.oracle.com/otn/berkeley-db/je-5.0.73.zip
You will need an oracle account. Create one to download the zip file.
Copy this zip file into your libext folder that you just created.
run command export MAVEN_OPTS="-Xmx1536m -XX:MaxPermSize=512m"
run command mvn clean install -DskipTests (MAKE SURE TO USE SKIP TESTS )
run command: mvn clean package -DskipTests -Pdist,berkeley-elasticsearch
Navigate to the following location: incubator-atlas/distro/target/apache-atlas-0.8-incubating-bin/apache-atlas-0.8-incubating/bin/atlas_start.py
OR
/home/arsalan/Development/atlas/distro/target/apache-atlas-0.9-SNAPSHOT-bin/apache-atlas-0.9-SNAPSHOT
Depending on which repo you have used.
Run the follwoing command python atlas_start.py

You can now navigate to localhost:21000 to check Atlas GUI.

Hope it helps!!!!!

View solution in original post

anaik · ‎08-14-2017

If you do a mvn clean install -DskipTests -Pdist,berkeley-elasticsearch (assuming you want to run embedded dependencies) and navigate to distro/target/atlas-<version>-bin, you will be able to run atlas on the local machine using the start script (atlas_start.py) under the bin folder.

If you're want to run against hbase and solr then you would need to have the following running before trying to start Atlas

Zookeeper
Hbase
Kafka
Solr (cloud mode)

Having these running is the first step towards starting Atlas. Once these services are up and running, you'll have to make sure that Zookeeper has registered the HBase servers, Kafka brokers and Solr servers as well. The next step is to update the atlas-application.properties with the correct address/ URLs for the above services. Once that's done then Atlas should be able to start serving requests.

PS: Running embedded mode is the fastest and easiest way to get Atlas up and running on a laptop/desktop.

Hope that helps, if it does please upvote and accept the answer.

arsalan_siddiqi · ‎08-14-2017

@anaik

I was trying with the following command:

mvn clean package _DskipTests-Pdist,embedded-hbase-solr

In this case Solr and Hbase are installed and started automatically when atlas starts. I did not know that I would need to install kafka separately. I do have it installed, I will run it and see if any parameter need to be set in atlas for kafka.

arsalan_siddiqi · ‎08-24-2017

I was not able to run it with hbase and solr. But the installation with berkely db and elastic works by simply following the installation steps on their website

anaik · ‎08-24-2017

Yeah running against external hbase and solr does require a lot of setup which is mostly done by Ambari in any HDP deployment.

arsalan_siddiqi · ‎08-24-2017

Ok here are all the steps required to run Apache Atlas natively with Berkeley DB and Elastic:

Download and install Kafka use the link : https://kafka.apache.org/downloads. Download the binary and extract to your required location.
Kafka and Atlas would also require Zookeeper. By default kafka comes with an instance of zookeeper. If you do not have zookeeper running or installed, you can use this. Navigate to and run : kafkahome/bin/zookeeper-server-start.sh
Once zookeeper has started you can check it using the command: netstat -ant | grep :2181. if everything is fine you should see: tcp6 0 0 :::2181 :::* LISTEN
Now you can start your kafka server using the command: ./kafkaHOME/bin/kafka-server-start.sh /KafkaHome/config/server.properties
To check if kafka is running run the command netstat -ant | grep :9092. You should see a similar result as mentioned above.
Now you are ready to move on with ATLAS. You can either use the link provided on the website or do a branch and tag checkout directly from github. I used the command on their website: git clone https://git-wip-us.apache.org/repos/asf/atlas.git atlas
navigate into the folder : cd atlas
Create new folder called libext using: mkdir libext
You need to download the jar file form this URL. http://download.oracle.com/otn/berkeley-db/je-5.0.73.zip
You will need an oracle account. Create one to download the zip file.
Copy this zip file into your libext folder that you just created.
run command export MAVEN_OPTS="-Xmx1536m -XX:MaxPermSize=512m"
run command mvn clean install -DskipTests (MAKE SURE TO USE SKIP TESTS )
run command: mvn clean package -DskipTests -Pdist,berkeley-elasticsearch
Navigate to the following location: incubator-atlas/distro/target/apache-atlas-0.8-incubating-bin/apache-atlas-0.8-incubating/bin/atlas_start.py
OR
/home/arsalan/Development/atlas/distro/target/apache-atlas-0.9-SNAPSHOT-bin/apache-atlas-0.9-SNAPSHOT
Depending on which repo you have used.
Run the follwoing command python atlas_start.py

You can now navigate to localhost:21000 to check Atlas GUI.

Hope it helps!!!!!

Cloudera Community

Support Questions

Can anyone make a step by step tutorial for Installing Apache Atlas .8 or .9 on you laptop

Steps to install supplementary Spark on HDP cluste...

FOUR STEP STRATEGY FOR INCREMENTAL UPDATES IN APAC...

Steps to setup Atlas with Ldaps (SSL)

Re: Steps to fix Ambari-server & agent expired cer...

Steps to Purge Ambari server operational data

Reading/Writing using R to an external table withi...

Steps to install and uninstall Atlas service on Ab...

Fail to install HCP 1.3 on Grafana step

Apache Ambari Workflow Manager View for Apache Ooz...

I want to install apache ranger with cloudera. Can...