Created 05-22-2017 10:00 AM
Hi,
Am trying to implement data lineage for my spark application. I Have kafka topic, spark streaming read data from kafka and place in data source. when I checked apache atlas it does n't provide any hooks for spark. I guess we have to use rest api for this implementation. can someone point to some documentation or example for this?
Created 05-22-2017 01:38 PM
Want to get a detailed solution you have to login/registered on the community
Register/LoginCreated 05-22-2017 01:38 PM
Want to get a detailed solution you have to login/registered on the community
Register/LoginCreated 05-23-2017 05:16 AM
Thanks for the answer. So I created metadata for my custom object in using rest api, then once I retrieved my event from spark streaming added as entity using rest api. So atlas will take care about lineage or do I need to add event modifications manually each and everytime?
Created 05-23-2017 02:03 PM
Take a look at the "Create Lineage amongst data sets" section (p. 46) in the document link I shared above. It also has a detailed example.
Created 05-24-2017 05:55 AM
yes. Got it @Eyad Garelnabi. Thanks
Created 08-05-2021 09:17 PM