- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Can anyone make a step by step tutorial for Installing Apache Atlas .8 or .9 on you laptop
- Labels:
-
Apache Atlas
Created ‎08-12-2017 09:51 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi
I tried to install apache atlas on my laptop unfortunately I keep on getting exceptions. Can any one make a tutorial to get atlas running on your laptop with embeded Solr and Hbase setting using the link:
How to configure it and what needs to be installed before hand (zookeeper). I just need a basic installation with defaults.
It would be great to mention
- Checkout URL
- maven commands
- additional dependencies (softwares)
- configurations file locations and settings
Once you download the repo there are multiple similar folders with the same content which I am confused about, regarding which one to use and where to run the scripts like start_atlas.py etc
Thanks
Created ‎08-24-2017 06:12 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Ok here are all the steps required to run Apache Atlas natively with Berkeley DB and Elastic:
- Download and install Kafka use the link : https://kafka.apache.org/downloads. Download the binary and extract to your required location.
- Kafka and Atlas would also require Zookeeper. By default kafka comes with an instance of zookeeper. If you do not have zookeeper running or installed, you can use this. Navigate to and run : kafkahome/bin/zookeeper-server-start.sh
- Once zookeeper has started you can check it using the command: netstat -ant | grep :2181. if everything is fine you should see: tcp6 0 0 :::2181 :::* LISTEN
- Now you can start your kafka server using the command: ./kafkaHOME/bin/kafka-server-start.sh /KafkaHome/config/server.properties
- To check if kafka is running run the command netstat -ant | grep :9092. You should see a similar result as mentioned above.
- Now you are ready to move on with ATLAS. You can either use the link provided on the website or do a branch and tag checkout directly from github. I used the command on their website: git clone https://git-wip-us.apache.org/repos/asf/atlas.git atlas
- navigate into the folder : cd atlas
- Create new folder called libext using: mkdir libext
- You need to download the jar file form this URL. http://download.oracle.com/otn/berkeley-db/je-5.0.73.zip
- You will need an oracle account. Create one to download the zip file.
- Copy this zip file into your libext folder that you just created.
- run command export MAVEN_OPTS="-Xmx1536m -XX:MaxPermSize=512m"
- run command mvn clean install -DskipTests (MAKE SURE TO USE SKIP TESTS )
- run command: mvn clean package -DskipTests -Pdist,berkeley-elasticsearch
- Navigate to the following location: incubator-atlas/distro/target/apache-atlas-0.8-incubating-bin/apache-atlas-0.8-incubating/bin/atlas_start.py
OR
/home/arsalan/Development/atlas/distro/target/apache-atlas-0.9-SNAPSHOT-bin/apache-atlas-0.9-SNAPSHOT
Depending on which repo you have used. - Run the follwoing command python atlas_start.py
You can now navigate to localhost:21000 to check Atlas GUI.
Hope it helps!!!!!
Created ‎08-14-2017 05:23 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
If you do a mvn clean install -DskipTests -Pdist,berkeley-elasticsearch (assuming you want to run embedded dependencies) and navigate to distro/target/atlas-<version>-bin, you will be able to run atlas on the local machine using the start script (atlas_start.py) under the bin folder.
If you're want to run against hbase and solr then you would need to have the following running before trying to start Atlas
- Zookeeper
- Hbase
- Kafka
- Solr (cloud mode)
Having these running is the first step towards starting Atlas. Once these services are up and running, you'll have to make sure that Zookeeper has registered the HBase servers, Kafka brokers and Solr servers as well. The next step is to update the atlas-application.properties with the correct address/ URLs for the above services. Once that's done then Atlas should be able to start serving requests.
PS: Running embedded mode is the fastest and easiest way to get Atlas up and running on a laptop/desktop.
Hope that helps, if it does please upvote and accept the answer.
Created ‎08-14-2017 06:05 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I was trying with the following command:
mvn clean package _DskipTests-Pdist,embedded-hbase-solr
In this case Solr and Hbase are installed and started automatically when atlas starts. I did not know that I would need to install kafka separately. I do have it installed, I will run it and see if any parameter need to be set in atlas for kafka.
Created ‎08-24-2017 05:28 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I was not able to run it with hbase and solr. But the installation with berkely db and elastic works by simply following the installation steps on their website
Created ‎08-24-2017 05:30 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Yeah running against external hbase and solr does require a lot of setup which is mostly done by Ambari in any HDP deployment.
Created ‎08-24-2017 06:12 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Ok here are all the steps required to run Apache Atlas natively with Berkeley DB and Elastic:
- Download and install Kafka use the link : https://kafka.apache.org/downloads. Download the binary and extract to your required location.
- Kafka and Atlas would also require Zookeeper. By default kafka comes with an instance of zookeeper. If you do not have zookeeper running or installed, you can use this. Navigate to and run : kafkahome/bin/zookeeper-server-start.sh
- Once zookeeper has started you can check it using the command: netstat -ant | grep :2181. if everything is fine you should see: tcp6 0 0 :::2181 :::* LISTEN
- Now you can start your kafka server using the command: ./kafkaHOME/bin/kafka-server-start.sh /KafkaHome/config/server.properties
- To check if kafka is running run the command netstat -ant | grep :9092. You should see a similar result as mentioned above.
- Now you are ready to move on with ATLAS. You can either use the link provided on the website or do a branch and tag checkout directly from github. I used the command on their website: git clone https://git-wip-us.apache.org/repos/asf/atlas.git atlas
- navigate into the folder : cd atlas
- Create new folder called libext using: mkdir libext
- You need to download the jar file form this URL. http://download.oracle.com/otn/berkeley-db/je-5.0.73.zip
- You will need an oracle account. Create one to download the zip file.
- Copy this zip file into your libext folder that you just created.
- run command export MAVEN_OPTS="-Xmx1536m -XX:MaxPermSize=512m"
- run command mvn clean install -DskipTests (MAKE SURE TO USE SKIP TESTS )
- run command: mvn clean package -DskipTests -Pdist,berkeley-elasticsearch
- Navigate to the following location: incubator-atlas/distro/target/apache-atlas-0.8-incubating-bin/apache-atlas-0.8-incubating/bin/atlas_start.py
OR
/home/arsalan/Development/atlas/distro/target/apache-atlas-0.9-SNAPSHOT-bin/apache-atlas-0.9-SNAPSHOT
Depending on which repo you have used. - Run the follwoing command python atlas_start.py
You can now navigate to localhost:21000 to check Atlas GUI.
Hope it helps!!!!!
