Support Questions

Find answers, ask questions, and share your expertise

Is Kafka necessary to import metadada into Atlas?

avatar
Rising Star

I am learning Apache Atlas and read the demo on https://github.com/shivajid/atlas/blob/master/tutorial/Step1.md.

This tutorial shows that import metadata into Atlas without using Kafka.

So, Kafka is not necessary when we want to import metadata from Hive to Atlas, right?

And from the docs of http://atlas.apache.org/Architecture.html, I realize that there will have some failure of communication between Hook and Atlas because of network issues and hence inconsistent of metadata. The Kafka would avoid this inconsistency happen, right?

What other function or benefit if we use Kafka in the communication between Hook and Atlas?

Thank you very much.

1 ACCEPTED SOLUTION

avatar

Hello Ethan

No kafka is not necessary when importing data into atlas. Atals will actually listen in to services like, Hive,Sqoop, Falcon etc... to automatically import data. You can also interact with the Atlas APIs, rest or no, to import your own data, say tags for example.

Kafka is very useful for example in the communication with ranger for security policies. As you add tags to data in Atlas you want Ranger to pick them up as soon as possible and kafka is that gateway.

Kafka in the not so distant future will also be a service monitored by Atlas as it also is a gateway for data inside hadoop and as such is a source Atlas should do governance for.

hope this helps

View solution in original post

2 REPLIES 2

avatar

Hello Ethan

No kafka is not necessary when importing data into atlas. Atals will actually listen in to services like, Hive,Sqoop, Falcon etc... to automatically import data. You can also interact with the Atlas APIs, rest or no, to import your own data, say tags for example.

Kafka is very useful for example in the communication with ranger for security policies. As you add tags to data in Atlas you want Ranger to pick them up as soon as possible and kafka is that gateway.

Kafka in the not so distant future will also be a service monitored by Atlas as it also is a gateway for data inside hadoop and as such is a source Atlas should do governance for.

hope this helps

avatar
Rising Star

Thank you very much.