I'm researching privacy products for a Big Data install. I've found many academic references to differential privacy (ε-differential, etc.) but I can't find a product in the Hortonworks stack that offers this. Is anyone aware of such a product, or if one is in the pipeline?
I'm aware of Privitar which deals with k-anonymity which would solve one of my issues but not the other.
Hi @Laura Ngo, that's an interesting topic but not so well-known in engineering circles. I just found some details here. There are only some elements available in HDP which might help you, like encryption of data at rest and giving permissions to access only certain columns in Hive and HBase tables, both provided by Ranger. Right away I cannot think of any other readily available, and even these would require further work to incorporate them into something more meaningful. Anyway, wishing you luck in your research!