Member since
09-26-2017
24
Posts
0
Kudos Received
0
Solutions
04-04-2018
07:26 AM
Hi @Joy Ndjama, Awesome ! Exactly what I was expecting. Even if it is quite expensive, it is a elegant way to get a true sample. Thanks @Scott Shaw as well, TABLESAMPLE is a very interesting functionnality too.
... View more
01-06-2018
08:39 AM
Hi @Gunther Hagleitner ; thanks it's very clear with your explainations.
... View more
01-01-2018
11:52 AM
thanks @Bala Vignesh N V ; it helps 🙂
... View more
12-17-2017
04:43 PM
Thanks a lot @bkosaraju it really helps me 🙂
... View more
11-09-2017
09:55 AM
Thanks it helps. before OVERWRITE : $ hdfs dfs -ls /apps/hive/warehouse/xyz.db/table_tmp
Found 1 items
718 2017-11-09 10:18 /apps/hive/warehouse/xyz.db/table_tmp/000000_0 during OVERWRITE : $ hdfs dfs -ls /apps/hive/warehouse/xyz.db/table_tmp
Found 2 items
0 2017-11-09 10:35 /apps/hive/warehouse/xyz.db/table_tmp/.hive-staging_hive_2017-11-09_10-35-38_682_2619781700846007196-1
718 2017-11-09 10:18 /apps/hive/warehouse/xyz.db/table_tmp/000000_0
after OVERWRITE : $ hdfs dfs -ls /apps/hive/warehouse/xyz.db/table_tmp
Found 1 items
718 2017-11-09 10:35 /apps/hive/warehouse/xyz.db/table_tmp/000000_0 What I understand is that a query running (involving the file in example), for example, since 10:15 and still executing at 10:35 does not garantee a good execution (but I can presume the file, especially because it is small here, will have already been processed in a first stage of the M/R process). Is that so ? I am wondering if OVERWRITE is a good way to build intermediate table in this case... Without LOCK functionnality enabled, do you suggest a better way ?
... View more
10-09-2017
07:24 AM
thanks 🙂
... View more
10-02-2017
01:29 PM
Thanks a lot for your reply. But, through JMX, will I be able to monitor a particular query ? Or just the global activity of the JVM ?
... View more