Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

How to install and configure a minimum system of Apache Atlas?

avatar
Rising Star

I am a student and these questions make me frustrated several weeks, please help me.

According a developer in this community, running Atlas needs Hive, namely, the minimum system of Atlas must consist of Atlas, Hive, Hadoop and a RDBMS such as MySQL.

I also did much research, and read the tutorial on https://github.com/shivajid/atlas. But this tutorial is based on HDP, and my teacher requires me to configure the minimum system by myself.

I imitated this tutorial of shivajid/atlas mentioned above, and summary some main steps of this tutorial:

Step1: Download Hadoop and configure it.

Step2: Execute git clone to get the source code of Atlas, build it by maven, and configure it.

### After this step, I started Atlas server, in order to verify if the server is up and running, I used a web browser to visit "

### http://localhost:21000/api/atlas/admin/version", and it showed some version information. but when I tried to visit "

### http://localhost:21000" or "http://localhost:21000/dashboard", it showed HTTP ERROR 500. This is my first question.

Step3: Install MySQL-server, create MySQL user and grant privileges to him.

Step4: Get binary Hive and configure MySQL as the metadata repository of Hive.

Step5: Execute the script of "MySQLSourceSystem.dll" which comes from tutorial of shivajid/atlas on github.

Step6: Import data from "drivers.csv" and "timesheet.csv" to tables in MySQL.

Step7: Execute script quick_start.py which in ${atlas_dir}/bin.

### But it report that, no data was added to the Apache Atlas Server. This is my second question and I stop here now.

The following steps,which were explained in the tutorialI, haven't been done:

Step8: Execute script sqoop_job.sh

Step9: Validate tables in Hive

Step10: Check metadata in Atlas Web UI.

Summary:

I did much research to configure the minimun system of Atlas but failed, so I concluded my main steps shown above.

But these are still a lots of problems:

1. After started Atlas server (Step 2), why I had failed to visit "http://localhost:21000/dashboard" and got ERROR 500.

2. After execute quick_start.py (Step 7), why it report that no data was added to the Apache Atlas Server?

3. I think, the tutorial of shivajid/atlas on github is outdated which was post in Sep 2015, and it must use HDP sanbox. Could anyone give me or help me to write a brief tutorial of minimum system of Atlas which just conclude these component: Atlas, Hive, Hadoop, MySQL.

This is my Email: dreamcoding@hotmail.com, If anyone have some relative stuff or material, please send them to me.

Thank you very much. Thank you!

1 ACCEPTED SOLUTION

avatar
Guru

You mention that you start the Atlas server but did you shutdown Atlas before running the quick_start.py script? Perhaps there was a lock on the files which needed to be updated.

You may also need to verify that all variables and options are properly set. Here is a link to the install guide. http://atlas.incubator.apache.org/InstallationSteps.html

View solution in original post

2 REPLIES 2

avatar
Guru

You mention that you start the Atlas server but did you shutdown Atlas before running the quick_start.py script? Perhaps there was a lock on the files which needed to be updated.

You may also need to verify that all variables and options are properly set. Here is a link to the install guide. http://atlas.incubator.apache.org/InstallationSteps.html

avatar
Rising Star

Do you mean that I must shutdown Atlas server before running the quick_start.py script?

And there is another important question: where does quick_start.py script get data from? In the other words, where should I import my data before running this script?