We`re evaluating the usage of tagging and metadata support in order to create an overview of hive table / files / other data sources containing personal identifiable information.
Based on this we have attached 5 key questions.
Any insight into how to perform data management using metadata and/or tagging on a hadoop 2.6.4 distributed file system is highly appreciated.