Member since
08-10-2017
6
Posts
0
Kudos Received
0
Solutions
02-07-2019
03:12 PM
I have tried with the following parms hive.tez.auto.reducer.parallelism=true; hive.tez.min.partition.factor=0.25 hive.tez.max.partition.factor=2.0 set hive.exec.reducers.bytes.per.reducer = 134217728; My output is of size 2.5 GB (2684354560 bytes) and based on the formula given above, i was expecting max(1,min(1099,2684354560/ 134217728))*2 = max(1,min(1099,20))*2 = max(1,20)*2 = 40 reducers. but my query was assigned only 5 reducers, i was curious why? and are there any other parameters that can reflect the no. of reducers. below is the query that i am using : truncate table target_tab ;
INSERT INTO TABLE target_tab
SELECT * FROM src_tab WHERE 1=1 ORDER BY a, b,c
... View more
08-10-2017
09:11 AM
hive> select cast(12345678910.231 as double); 1.2345678910231E10 Time taken: 0.164 seconds, Fetched: 1 row(s) but need the value 12345678910.231 stored as double. select cast(cast(12345678910.231 as double) as double); also wont work and is similar as the above. I dont need the output format to be change using the printf() UDF, rather i need the value to be stored and retrieved while querying as a non exponential form.
... View more
Labels:
- Labels:
-
Apache Hive