Created 11-19-2018 10:53 AM
Hello everybody,
I'm kinda of new to Hive and Hadoop . I have a query which is taking 10 minutes to complete the query .
Size of the data is 10GB
Partition and Bucketing is done in the table .
How can I improve the below query .
select * fromtbl1 where clmn='Abdul' and loc='IND' and TO_UNIX_TIMESTAMP(ts) > (UNIX_TIMESTAMP() - 5*60*60);
set hive.vectorized.execution.reduce.enabled=true; set hive.tez.container.size=8192; set hive.fetch.task.conversion = none; set mapred.compress.map.output=true; set mapred.output.compress=true; set hive.fetch.task.conversion=none;
None of the parameters helped us to resolve the query in shorter period of time .