Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Tools to tag data in HDFS

avatar
Expert Contributor

Hi All, just wanted to get your input regarding what tools (preferably open source) are available that can tag data in HDFS; from what I understand Atlas is not able to do that. Heard about Waterline, any others that you guys know of.

Thanks.

1 ACCEPTED SOLUTION

avatar
Master Guru

waterline has feature called hdfs crawler which uses a algorithm to tag data. Attivio is another tool which can tag data based on a data mart concept. Both tools are best in class in my opinion.

View solution in original post

2 REPLIES 2

avatar
Master Guru

waterline has feature called hdfs crawler which uses a algorithm to tag data. Attivio is another tool which can tag data based on a data mart concept. Both tools are best in class in my opinion.

avatar
Master Guru

Here is the link to a short tutorial with Waterline using a HDP Sandbox created jointly by Hortonworks and Waterline:

Manage your Data Lake more efficiently with Waterline Data Inventory and HDP