I don't have end-to-end example or script flow for my use case but there is an idea in my mind regarding "Automatic tag Detection" likewise "waterlinedata" tool does for us.
I am just looking for any pre-available hortonworks tool/library which will analyze my data and will come out with results(i.e. by from learning data the tool/library will suggest me the Tag).
if have employee dataset and there are two columns in it SSN and "date_of_birth". so library or tool will learn this employee dataset and will suggest me that, these column should be tagged unser PII(Personal Identificable Infomration).
so is it possible in hortonworks or in any other tools/library?
I think we can achieve same thing using python scikit-learn library but can we do it using hortonworks in-built algorithm?
Thanks in advance.