Member since
06-26-2018
32
Posts
3
Kudos Received
0
Solutions
12-19-2019
05:07 AM
Thanks @lwang, Your diligence is greatly appreciated! Looking forward to a response as this folder is growing quite out of control. Greg Frair
... View more
12-11-2019
08:56 AM
For clarification as well, there are two directories within "/user/hue/oozie": - deployments - workspaces I'm thinking deleting files within each of these will have their own set of implications. Thanks again for looking into this! Greg
... View more
12-07-2019
04:53 AM
Thanks Li, Can you provide some additional information? If I was to delete all files in this folder over 7 days, should I not cause any issues? Do these workspaces only get created by oozie when a job is about to start? What about re-occurring jobs, do any rely on these source files? Any assistance is greatly appreciated. Thanks, Greg
... View more
12-05-2019
10:45 AM
Good day,
We are in the process of tackling small file issues compounded in a variety of areas in HDFS. One of these areas is "/user/hue/" (which currently has over 2000 small files in it). I'm wondering what are the impacts of clearing this directory? I was under the impression that everything "Hue dependent" was stored in the Hue database, but I also want to ensure that I don't break anything (specifically in /user/hue/oozie, but any "break" is a bad break).
Any advice is greatly appreciated.
Can't seem to locate any documentation on this.
Thanks,
Greg Frair
... View more
Labels:
- Labels:
-
Cloudera Hue
02-15-2017
07:43 PM
Also, I can't speak to the overall stability of CDH, as I definitely haven't performed a lot of activity on it... I just know that I didn't experience this particular issue, which was a little unerving for me.
... View more
02-15-2017
07:40 PM
I'll give it a try and let you know how it works... I'll accept your answer in the mean time as it looks promising. Thanks!
... View more
02-11-2017
11:45 PM
Currently, I need it running for my purposes... is there any way that I can turn off logging globally and only turn it on for troubleshooting? I was using the cloudera sandbox and had it running for 3 months straight without this issue, so it's a little concerning to me that this could be present even if/when we productionalize. Also, any ideas how I can confirm whether it's caused by logging or not?
... View more
02-11-2017
11:45 PM
Currently, I need it running for my purposes... is there any way that I can turn off logging globally and only turn it on for troubleshooting? I was using the cloudera sandbox and had it running for 3 months straight without this issue, so it's a little concerning to me that this could be present even if/when we productionalize. Also, any ideas how I can confirm whether it's caused by logging or not?
... View more
02-11-2017
06:35 PM
1 Kudo
Good day, I've installed the HDP 2.5 sandbox to try out on my laptop and found that it was continuing to allocate space slowly (without me doing any real activity, such as loading in data) until my hard drive ran out of space. I then found a server that had a 100GB partition which I installed it on and found the same behavior. It took a little over a week, but without any real activity occurring (that I was aware of) on the VM, it eventually just ran out of 100GB of space and crashed the VM... forcing me to do a re-install. Does anyone have any idea why it would just continually allocate space, as I can't figure out what is growing? I would like to turn it off so that I don't have to keep re-installing the VM. It's running on VMware if that helps. I believe I installed it on Virtual Box when I set it up on my laptop and experienced the same behavior. Any assistance is greatly appreciated! Greg
... View more
Labels:
- Labels:
-
Hortonworks Data Platform (HDP)