Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Apache Atlas on Amazon EMR - Hive metadata

Highlighted

Apache Atlas on Amazon EMR - Hive metadata

New Contributor

Hello

I'm trying to configure Apache Atlas on Amazon EMR. I have installed Apache Atlas 1.0.0 on Amazon EMR 5.27.0, using as reference the template provided in the article: https://aws.amazon.com/pt/blogs/big-data/metadata-classification-lineage-and-discovery-using-apache-.... In order to persist Atlas data outside the EMR cluster, so the data is not lost when the cluster shuts down, I have enabled Amazon S3 storage mode for HBase on EMR (https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-hbase-s3.html). I have also configured Hive to use AWS Glue Data Catalog as its metastore. After importing Hive metadata into Atlas by running the "import-hive.sh" script, Hive metadata could be listed on the Atlas interface. However, after shutting down the EMR cluster and then recreating it using the same settings, Hive metadata could not be fetched from the Atlas basic search interface, but only from the advanced search interface. Even after I run "import-hive.sh" script again, the Hive metadata is not listed on Atlas basic search. Could anybody help me?

Thank you.

1 REPLY 1
Highlighted

Re: Apache Atlas on Amazon EMR - Hive metadata

@deboras While we welcome your question, it is much better suited to the appropriate AWS forum for EMR.

 

 

Bill Brooks, Community Manager
Was your question answered? Make sure to mark the answer as the accepted solution.
If you find a reply useful, say thanks by clicking on the thumbs up button.
Don't have an account?
Coming from Hortonworks? Activate your account here