Member since
08-17-2018
39
Posts
3
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
6000 | 08-28-2018 07:24 AM |
02-21-2020
07:52 AM
I actually did figure out how to solve this, at least in our case. Hopefully it's relevant to yours as well. See here.
... View more
08-26-2019
08:56 AM
1 Kudo
Just a note about the CM API: From what I can tell, the API doesn't bring more parameters back that what you can see in the Configuration tab for each app/service. Being able to see them in the INFO logs was exactly what was needed. However, it would be nice to be able to use the API to get the same info as the INFO logs provides. I could see some automation opportunities in the future...
... View more
08-22-2019
01:50 PM
Yes - that WIP links back to KUDU-1603 that I shared earlier. Guess, we will have to wait it out. Thanks for your response.
... View more
08-19-2019
06:22 AM
>"Your cluster does sound unhappy" LOL. I'd say more like pi$$ed. 🙂 > If it's a JVM issue, we've seen in some cases that increasing heap sizes helps. Setting ipc.client.rpc-timeout.ms to 60000 I'd say it's more like a set of overloaded namenodes. And, according to the research I've done so far, for another problem, I have a theory that I can use a more up to date GC and it should increase the performance and reduce the number of Zookeeper failures we have. Our ZK failures are sporadic and happen every 2 to 3 days. Sometimes more sometimes less. Moving ZK to separate nodes is not an option at this point and I'm doing all I can to try to reduce the number of failures short of moving the service. I'll check our settings on this and see if we can do one or both. I suspect we have increased JVM heap already, but not sure? >We've also seen the file handle cache that got enabled by default in CDH5.15 help a lot in reducing namenode load, I assume this is available before this version but was not enabled by default??? I'll look it up and see... > I agree 100%. I think whoever named it was either overly optimistic and assumed there wouldn't be a significant gap in time, or it was named from the point of view of the code rather than the external system So, my question is, is there an indicator in the Query Details that indicates something was returned? I know I get an initial set of results back. Without that "fetch" metric meaning what the word actually says, I don't know what indicates how long it took to get the first set of records back??? Back to the original issue... Given that the issue appears to be the last query issued in Hue tends to show up as still executing 2.2 hours later and has already returned a count almost immediately. Obviously, the parameters for idle timeouts for sessions and queries is not marking the session as closed. Therefore appearing to still be executing: Is this causing resource issues because the session is being held open and appearing to be still executing? I would assume so as it is waiting on a fetch of subsequent sets of records??? What parameter(s) will close the session from the last query executed? Just to let you know, I've come in late to the game and am still learning CM and Cloudera Manager. I understand a lot but with 1000s of parameters for all the apps and an ample supply of problems, it'll take a while. 🙂 Thanks for all your help. It is nice to have a response on this forum. The last couple posts were not very useful. We do have a service contract and although I am one of the 3 admins, they are working on adding me to the support contract so I can put in tickets and get support directly. Until then, I appreciate the help!
... View more
08-18-2019
09:47 PM
I updated this response to include more information, especially with some details not mentioned in the Hue page about what happens in Impala.
... View more
09-18-2018
11:00 AM
Yeah, they sold Powerschool off to a competitor who took it way past what we had. One of the Java engineers talked the new company into letting him re-write it in Java. It looks pretty cool from the last time I looked at it. I remember that group back in the day. I still have my email address and am using 50GB of iCloud storage for the day my Mac goes down. I bought a 2016 laptop in 2017 and absolutely love it. All except the memory. In which they fixed in the newest releases of MacBook Pro. Dang company! 🙂 Dude, I got some stories of sticking with something. I have about 33 years of Unix experience starting back in the day with Sun Microsystems servers. I own a Sun Sparc Classic and a Sun Enterprise 250 workgroup server. Both of them will still boot. I wil let you know if anything pops up as I am going to do some Django development as a phase II project to manage users automatically. I figured out how to web scrape the Hue page using the older user list URL (/useradmin/users/) that doesn't use Javascript to load the user list (/hue/useradmin/users/). Although, Hue warns the older URL is old, it still works. I have not figured out yet how to load the user list with the new (current) URL of "/hue/useradmin/users/" but will get back to it later. It would be nice to have an API for Hue as Cloudera Manager has. My project was to auto-delete Cloudera and Hue users that have left the company. I have all the prototype functionality built and am about to bring it all together into a package. Thanks for all your help!
... View more
08-28-2018
07:24 AM
@bgooley Got it! I'm relatively new to Python development and was trying figure out what was an acceptable form of multiple criteria for a single POST request for the same key. <== Maybe that sentence will help someone find this later on. I used tcpdump as well but could not see the POST data as it was encrypted with SSL. I was not able to turn off SSL to see plain text content as I don't have access to the server. I was exploring using a cert to decrypt but have never done that using tcpdump. I would love to know how, as I have run into trying to debug traffic and could not see the content. Anyway, I was having trouble creating a dict for the form data as it would only accept a single key. I didn't realize that an array(list) with a single dict key would work as well. This line was a problem in that Python would not let me add multiple key entries like "user_ids='4', user_ids='5'". I didn't realize all it needed was a list. form_data = dict(csrfmiddlewaretoken=session.cookies['csrftoken'] ,next='/hue/useradmin/users/', user_ids=['4','5']) I tried every combination I could think of but made the mistake of surrounding the arguments in quotes, making them a single text argument versus a list. What you did for me was verify a few things and put me on the right trail. Thanks to everyone for their contribution!
... View more