Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Hive scratch data - Track the process that created it

Highlighted

Hive scratch data - Track the process that created it

New Contributor

Hello:

I have an HDFS cluster where i get temporary Storage alerts that sometimes last hours.

When tracking the space consumption, i usually end up in the tmp/hive folder.

It looks like some job creates a very big amount of scratch data.

Is there any way to track the data to the hive job? Usually, if i go to the tez view, there are no jobs running, and it looks like most of the time, the temporary directory is not locked, for write.

I am running a HDP 2.6.4 cluster.

Thanks!

Regards

Don't have an account?
Coming from Hortonworks? Activate your account here