Member since
06-26-2015
515
Posts
138
Kudos Received
114
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 2271 | 09-20-2022 03:33 PM | |
| 6048 | 09-19-2022 04:47 PM | |
| 3260 | 09-11-2022 05:01 PM | |
| 3734 | 09-06-2022 02:23 PM | |
| 5824 | 09-06-2022 04:30 AM |
03-25-2022
10:29 PM
Woah! CDH 5.4.3 is *really* old. Unfortunatelly I don't have a cluster running that version here to test. Hive has come a long way since then. We're already using Hive 3 on CDP 7.x. I'd recommend you upgrade your system, if possible. Cheers, André
... View more
03-25-2022
10:25 PM
@Boss , These are upper bound values to ensure that the services running on the machine won't run into limitations on the number of processes or open file descriptors. IMO, these are really pertinent parameters when you have gateway servers where tens or hundreds users connect to to run their own processes and you want to make sure no single user will run rogue processes that will starve everyone else of resources. The hosts in a CDP cluster enviroment are typically not hosts where users should be connecting directly to. The services and processes that run on those hosts are well known and managed by the administrator. In this scenario, these parameter are not as critical and we usually set them to a value that get them "out of the way", so that that we never reach them. Specifically to answer your question, though: "nofile" is the limit of open file descriptors. Note that file descriptors are not only associated to files; for example, they are also used to refer to open network sockets/ports and pipes. You can check the file descriptors currently open using the command "lsof" "nproc" is the limit of running processes. You can check that with the command "ps". Cheers, André
... View more
03-25-2022
01:35 PM
How did you create the impala_test principal?
... View more
03-25-2022
05:43 AM
1 Kudo
@pandu2022 , Please check the servicePrincipalName (SPN) property of the AD user. It should be impala_test/<host>@realm. André
... View more
03-24-2022
10:48 PM
@Aditya-Moghe , Would you be able to better explain what you're trying to achieve? You post is not very clear. Cheers, André
... View more
03-24-2022
05:18 PM
@AndreDre1 , did the above answer your question? André -- Was your question answered? Please take some time to click on "Accept as Solution" below this post. If you find a reply useful, say thanks by clicking on the thumbs up button.
... View more
03-24-2022
05:17 PM
1 Kudo
@pandu2022 , The KDC does not need to connect to Impala servers. Do you happen to have multiple realms in your environment with cross-realm trust configured between them? Could you please run the below commands and share the output? kinit <your_user>
kvno impala/<host_fqdn>@<REALM>
kvno impala_test/<host_fqdn>@<REALM> Cheers, André -- Was your question answered? Please take some time to click on "Accept as Solution" below this post. If you find a reply useful, say thanks by clicking on the thumbs up button.
... View more
03-24-2022
05:01 PM
@wazzu62 , "Connection refused" errors usually means that the server is reachable but not accepting connections on that port. If it was a firewall issue you should've seen a "connection timeout" type of error. This might mean that either (a) the service running on the target server is not actually running or (b) it has been configured with a different port. Have you tried telneting into that port to check if it's open at all? Try this locally from the server and remotely from where the ATS is running. Cheers, André -- Was your question answered? Please take some time to click on "Accept as Solution" below this post. If you find a reply useful, say thanks by clicking on the thumbs up button.
... View more
03-24-2022
04:02 PM
@CJ-Llanes , The expression ${id} refers to a flowfile attribute called "id", not to the "id" attribute of your flowfile content. You need to extract the "id" from the flowfile first with a EvaluateJsonPath between your FlattenJson and PutDynamoDB processors. Like this: Cheers, André -- Was your question answered? Please take some time to click on "Accept as Solution" below this post. If you find a reply useful, say thanks by clicking on the thumbs up button.
... View more
03-24-2022
03:50 PM
@mystefied_ , Which version of Hive are you using? That query works well on my cluster. Nevertheless, you should be able to run the below, which is pretty much the same: select
yr,
mth,
month_total,
month_total / lag(month_total, 1) over (order by yr, mth) as percentage_over_previous_month,
sum(month_total) over (order by yr, mth) as running_sum,
sum(month_total) over (order by yr, mth) / sum(month_total) over (partition by 1) as running_percentage
from (
select
year(stock_date) as yr,
month(stock_date) as mth,
sum(stock_price) as month_total
from table4
group by year(stock_date), month(stock_date)
) x Cheers, André -- Was your question answered? Please take some time to click on "Accept as Solution" below this post. If you find a reply useful, say thanks by clicking on the thumbs up button.
... View more