Reply
Posts: 394
Topics: 11
Kudos: 60
Solutions: 35
Registered: ‎09-02-2016

Cloudera Navigator - Identify unused db, tables, files, folders, etc

Hi

 

Does Cloudera navigator has an option to identify Unused objects for a particular period (like more than 6 months, 1 year, etc)?

 

The object can be HDFS files, Hive/Impala tables/Oozie, dataset, etc

 

This is my requirement: Our non-prod environment has been used by multiple users for different reasons like dev, test, etc. Sometimes they use common user id & user space to create db, create/import tables, etc. After the task finished, they will move to the next task without cleaning the old DB, tables, files  which become garbage after few days. 

 

It has been accumulated and become a big garbage now (with 3 replication). I want to identify the DB, tables, files which are not in use for more than 6 months (or) 1 year and delete them (with proper approval...)

 

Is it possible with Navigator? is there any other option/ideas?

 

Thanks

Kumar

 

Posts: 394
Topics: 11
Kudos: 60
Solutions: 35
Registered: ‎09-02-2016

Re: Cloudera Navigator - Identify unused db, tables, files, folders, etc

upon further analysis, i've noticed that "navigator policies" might help on this

 

https://www.cloudera.com/documentation/enterprise/5-5-x/topics/navigator_policies.html

 

It seems that I need to write search query, let me try to write one... In the mean time, it will be great if some share the query for the above scenario...

 

 

New Contributor
Posts: 1
Registered: ‎03-18-2016

Re: Cloudera Navigator - Identify unused db, tables, files, folders, etc

Anyone had luck getting this query right? Please share some examples.

 

Thanks
Rahul

Announcements
New solutions