Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Add metadata automatically

avatar
Rising Star

Hello,

I am searching a way to do this :

On HDFS, new datas arrived on a certain folder, and in the datas, there could be metadatas like names, emails, that are defined, and I would like a way to use these metadas to class my datas.

I know that you can define tags on Falcon, but it is not automatically.

Is there a way to automatically use the metadatas that are in a file, as to transfer them to falcon or another tools ? The goal is to use a Search tool (like there is in falcon).

1 ACCEPTED SOLUTION

avatar
Master Mentor
@Arthur GREVIN

Take a look on this https://community.hortonworks.com/content/repo/5240/hadoop-data-pipeline-using-falcon.html

Atlas is the solution for this but as of now, there is hook for hive only. HDFS work is in progress.

View solution in original post

2 REPLIES 2

avatar
Master Mentor
@Arthur GREVIN

Take a look on this https://community.hortonworks.com/content/repo/5240/hadoop-data-pipeline-using-falcon.html

Atlas is the solution for this but as of now, there is hook for hive only. HDFS work is in progress.

avatar
Rising Star

Thank you for your answer.