Member since
07-16-2020
6
Posts
0
Kudos Received
0
Solutions
08-04-2020
07:45 AM
Hi, is it possible to setup Oozie in Hue in such a way that: 1. It peforms a query automatically on a daily basis 2. Send these results to a group of people as an eMail attachment Unfortunately I have not found any material how to setup Oozie in a useful fashion. I guess this would be useful for the beginning. Thanks, KS
... View more
Labels:
- Labels:
-
Apache Hive
-
Apache Oozie
07-24-2020
08:22 AM
Hi, I'm using the Impala function "show table stats": show table stats table1; I get following results business_date tec_execution_date #Rows #Files Size Bytes Cached Cache Replication Format 13/05/2020 13/05/2020 20:08 0 0 0B NOT CACHED NOT CACHED PARQUET 14/07/2020 15/07/2020 16:39 6 1 16.08KB NOT CACHED NOT CACHED PARQUET 15/07/2020 16/07/2020 16:07 770 2 114.68KB NOT CACHED NOT CACHED PARQUET 16/07/2020 20/07/2020 16:50 272 1 41.88KB NOT CACHED NOT CACHED PARQUET 17/07/2020 21/07/2020 12:38 290 1 45.33KB NOT CACHED NOT CACHED PARQUET 20/07/2020 20/07/2020 12:16 379 1 51.10KB NOT CACHED NOT CACHED PARQUET 21/07/2020 22/07/2020 12:45 156 1 31.50KB NOT CACHED NOT CACHED PARQUET 24/07/2020 24/07/2020 15:15 278 1 40.26KB NOT CACHED NOT CACHED PARQUET Total 2431 8 340.84KB 0B However if I sum up the rows in column "#Rows" I get 2151...! Am I'm missing something? How is that possible? Thanks Thanks
... View more
Labels:
- Labels:
-
Apache Impala
07-16-2020
05:27 AM
Hi, Is there a way to get a list of columns with non-missing values? Note that I got more then 2000 columns, so check every column manually is not possible. I tried to use metadata: ANALYZE TABLE test COMPUTE STATISTICS FOR COLUMNS;
DESCRIBE FORMATTED test; However this is not really helpful because it does not give me useful information on how the columns are actually filled. For the beginning, an output table as described in https://cwiki.apache.org/confluence/display/Hive/StatsDev#StatsDev-ExistingTables. using the command desc formatted concurrent_delete_different partition(ds='tomorrow') name; would be enough. Unfortunately, this command doesn't work using Hue. Any idea?
... View more
Labels:
- Labels:
-
Apache Hive
-
Cloudera Hue