Member since
07-16-2018
9
Posts
0
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
2038 | 07-19-2018 05:54 AM |
07-29-2021
08:00 AM
That's what I was afraid of. I'm going through Cloudera Manager and looking at queries where missing_stats=true and going through one by one more or less. Thanks for the response!
... View more
07-28-2021
08:10 AM
I've read query plans in SQL Server forever using a GUI, but text plans are like Greek to me (I don't speak Greek). I'm trying to figure out if statements like this calculate the date "constant" for each row and cause performance lag, or if Impala knows to calculate it once and apply to all rows. select *
from myTable
-- the "months_add" is basically a constant, but I can see
-- why a function might be calculated for each row
where tran_timestamp > months_add(now(), -1)
limit 1;
select *
from myTable
-- but what about this? this is definitely a constant
-- so is this different than the query above and is
-- the "constant" calculated for each row?
where tran_timestamp > months_add('2021-01-01', -1)
limit 1
... View more
Labels:
- Labels:
-
Apache Hadoop
-
Apache Impala
07-28-2021
08:03 AM
I'd like to update stats (compute stats) for every table I have. I have a ton of queries that are running without them. I personally have access only to Hue for Impala, no console kind of access to server or anything. I'm able to see status of stats for a table with describe formatted <table>, but it's a major pain to look through, and I have a ton of tables.
Is there any way to get a list of all tables and the status of their stats from Hue?
Thanks!
... View more
Labels:
- Labels:
-
Apache Hadoop
-
Apache Impala
-
Cloudera Hue
05-31-2019
10:29 AM
Not helpful yet, but promising... PIVOT keyword is reserved for future use! https://www.cloudera.com/documentation/enterprise/6/6.2/topics/impala_reserved_words.html
... View more
07-19-2018
05:54 AM
Turned out to be pretty simple. Hadoop (or at least Hive) prefers two backslashes like \\d rather than \d. I ended up using this, which also requires one to four digits. [(]\\d{1,4}[)]$
... View more
07-16-2018
06:22 AM
I'm executing the following query in Hive. Many of my field values in my actual table look like: ASDF (1) abc 123 xyz (10) etc. etc. I just want to replace any digits in parentheses at the end of a string with "x". select 'SP0724_FA (1)' as str, regexp_replace(trim('SP0724_FA (1)'), ' [(]\d+[)]$', 'x') as str_2 The online testing app at regex101.com matches this. However, Hive returns the same value for str and str_2. I would expect to see "SP0724_FAx" as the str_2 value. Please note the space at the beginning of the regex pattern. Any ideas? Thank you!
... View more
Labels:
- Labels:
-
Apache Hive