Created 05-02-2016 01:44 AM
I am a student and these questions make me frustrated several weeks, please help me.
According a developer in this community, running Atlas needs Hive, namely, the minimum system of Atlas must consist of Atlas, Hive, Hadoop and a RDBMS such as MySQL.
I also did much research, and read the tutorial on https://github.com/shivajid/atlas. But this tutorial is based on HDP, and my teacher requires me to configure the minimum system by myself.
I imitated this tutorial of shivajid/atlas mentioned above, and summary some main steps of this tutorial:
Step1: Download Hadoop and configure it.
Step2: Execute git clone to get the source code of Atlas, build it by maven, and configure it.
### After this step, I started Atlas server, in order to verify if the server is up and running, I used a web browser to visit "
### http://localhost:21000/api/atlas/admin/version", and it showed some version information. but when I tried to visit "
### http://localhost:21000" or "http://localhost:21000/dashboard", it showed HTTP ERROR 500. This is my first question.
Step3: Install MySQL-server, create MySQL user and grant privileges to him.
Step4: Get binary Hive and configure MySQL as the metadata repository of Hive.
Step5: Execute the script of "MySQLSourceSystem.dll" which comes from tutorial of shivajid/atlas on github.
Step6: Import data from "drivers.csv" and "timesheet.csv" to tables in MySQL.
Step7: Execute script quick_start.py which in ${atlas_dir}/bin.
### But it report that, no data was added to the Apache Atlas Server. This is my second question and I stop here now.
The following steps,which were explained in the tutorialI, haven't been done:
Step8: Execute script sqoop_job.sh
Step9: Validate tables in Hive
Step10: Check metadata in Atlas Web UI.
Summary:
I did much research to configure the minimun system of Atlas but failed, so I concluded my main steps shown above.
But these are still a lots of problems:
1. After started Atlas server (Step 2), why I had failed to visit "http://localhost:21000/dashboard" and got ERROR 500.
2. After execute quick_start.py (Step 7), why it report that no data was added to the Apache Atlas Server?
3. I think, the tutorial of shivajid/atlas on github is outdated which was post in Sep 2015, and it must use HDP sanbox. Could anyone give me or help me to write a brief tutorial of minimum system of Atlas which just conclude these component: Atlas, Hive, Hadoop, MySQL.
This is my Email: dreamcoding@hotmail.com, If anyone have some relative stuff or material, please send them to me.
Thank you very much. Thank you!
Created 05-02-2016 09:14 PM
You mention that you start the Atlas server but did you shutdown Atlas before running the quick_start.py script? Perhaps there was a lock on the files which needed to be updated.
You may also need to verify that all variables and options are properly set. Here is a link to the install guide. http://atlas.incubator.apache.org/InstallationSteps.html
Created 05-02-2016 09:14 PM
You mention that you start the Atlas server but did you shutdown Atlas before running the quick_start.py script? Perhaps there was a lock on the files which needed to be updated.
You may also need to verify that all variables and options are properly set. Here is a link to the install guide. http://atlas.incubator.apache.org/InstallationSteps.html
Created 05-03-2016 06:43 AM
Do you mean that I must shutdown Atlas server before running the quick_start.py script?
And there is another important question: where does quick_start.py script get data from? In the other words, where should I import my data before running this script?