Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

How to install and configure a minimum system of Apache Atlas?

avatar
Rising Star

I am a student and these questions make me frustrated several weeks, please help me.

According a developer in this community, running Atlas needs Hive, namely, the minimum system of Atlas must consist of Atlas, Hive, Hadoop and a RDBMS such as MySQL.

I also did much research, and read the tutorial on https://github.com/shivajid/atlas. But this tutorial is based on HDP, and my teacher requires me to configure the minimum system by myself.

I imitated this tutorial of shivajid/atlas mentioned above, and summary some main steps of this tutorial:

Step1: Download Hadoop and configure it.

Step2: Execute git clone to get the source code of Atlas, build it by maven, and configure it.

### After this step, I started Atlas server, in order to verify if the server is up and running, I used a web browser to visit "

### http://localhost:21000/api/atlas/admin/version", and it showed some version information. but when I tried to visit "

### http://localhost:21000" or "http://localhost:21000/dashboard", it showed HTTP ERROR 500. This is my first question.

Step3: Install MySQL-server, create MySQL user and grant privileges to him.

Step4: Get binary Hive and configure MySQL as the metadata repository of Hive.

Step5: Execute the script of "MySQLSourceSystem.dll" which comes from tutorial of shivajid/atlas on github.

Step6: Import data from "drivers.csv" and "timesheet.csv" to tables in MySQL.

Step7: Execute script quick_start.py which in ${atlas_dir}/bin.

### But it report that, no data was added to the Apache Atlas Server. This is my second question and I stop here now.

The following steps,which were explained in the tutorialI, haven't been done:

Step8: Execute script sqoop_job.sh

Step9: Validate tables in Hive

Step10: Check metadata in Atlas Web UI.

Summary:

I did much research to configure the minimun system of Atlas but failed, so I concluded my main steps shown above.

But these are still a lots of problems:

1. After started Atlas server (Step 2), why I had failed to visit "http://localhost:21000/dashboard" and got ERROR 500.

2. After execute quick_start.py (Step 7), why it report that no data was added to the Apache Atlas Server?

3. I think, the tutorial of shivajid/atlas on github is outdated which was post in Sep 2015, and it must use HDP sanbox. Could anyone give me or help me to write a brief tutorial of minimum system of Atlas which just conclude these component: Atlas, Hive, Hadoop, MySQL.

This is my Email: dreamcoding@hotmail.com, If anyone have some relative stuff or material, please send them to me.

Thank you very much. Thank you!

1 ACCEPTED SOLUTION

avatar
Guru

You mention that you start the Atlas server but did you shutdown Atlas before running the quick_start.py script? Perhaps there was a lock on the files which needed to be updated.

You may also need to verify that all variables and options are properly set. Here is a link to the install guide. http://atlas.incubator.apache.org/InstallationSteps.html

View solution in original post

2 REPLIES 2

avatar
Guru

You mention that you start the Atlas server but did you shutdown Atlas before running the quick_start.py script? Perhaps there was a lock on the files which needed to be updated.

You may also need to verify that all variables and options are properly set. Here is a link to the install guide. http://atlas.incubator.apache.org/InstallationSteps.html

avatar
Rising Star

Do you mean that I must shutdown Atlas server before running the quick_start.py script?

And there is another important question: where does quick_start.py script get data from? In the other words, where should I import my data before running this script?