Created on 02-06-2019 10:44 PM - edited 09-16-2022 07:08 AM
Hi,
We applied data redaction rules for email and credit card on HDFS services. But when we query on data in hive through shell or HUE UI, redaction policies are not applied on data and data is not redacted. We tried it on 5.8 version of Quickstart VM . Please update and revert.
Thanks
Divya
Created 02-07-2019 06:29 AM
Could you please put your steps here you did
Created 02-07-2019 07:06 AM
Hello Divya,
The purpose of Sensitive Data Redaction is to sanitize log files and query history or any other activitity that are stored outside of the database. It is not applied to the actual data in your database.
When data redaction is enabled, the following data is redacted:
For example, if you set the search rules to replace "\d{3}[^\w]\d{2}[^\w]\d{4}" with "XXX-XX-XXXX" and, as a user authorized to access the table "employees", you run the following query:
SELECT * FROM employees WHERE ssn = '123-45-6789'
The query will return the data you requested from the database. The data redaction will be applied to the query history in Hue which will save the query as:
SELECT * FROM employees WHERE ssn = 'XXX-XX-XXXX'
If your database contains sensitive information, to protect it:
Please see the blog post at https://blog.cloudera.com/blog/2015/06/new-in-cdh-5-4-sensitive-data-redaction/ and the documentation at https://www.cloudera.com/documentation/enterprise/5-8-x/topics/sg_redaction.html for more information.
Thanks!
David Wilder, Community Manager