- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Tools to tag data in HDFS
- Labels:
-
Apache Atlas
Created ‎02-24-2017 11:07 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi All, just wanted to get your input regarding what tools (preferably open source) are available that can tag data in HDFS; from what I understand Atlas is not able to do that. Heard about Waterline, any others that you guys know of.
Thanks.
Created ‎02-24-2017 11:10 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
waterline has feature called hdfs crawler which uses a algorithm to tag data. Attivio is another tool which can tag data based on a data mart concept. Both tools are best in class in my opinion.
Created ‎02-24-2017 11:10 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
waterline has feature called hdfs crawler which uses a algorithm to tag data. Attivio is another tool which can tag data based on a data mart concept. Both tools are best in class in my opinion.
Created ‎02-26-2017 12:33 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Here is the link to a short tutorial with Waterline using a HDP Sandbox created jointly by Hortonworks and Waterline:
Manage your Data Lake more efficiently with Waterline Data Inventory and HDP
