Member since
10-08-2018
2
Posts
0
Kudos Received
0
Solutions
10-16-2018
04:33 PM
Use Bucketing using ID and use sort , also store as ORC file format and compress and set hive.exec.orc.split.strategy=BI; Example : CREATE
TABLE Employee( ID BIGINT, NAME
STRING, AGE INT, SALARY
BIGINT, DEPARTMENT
STRING ) COMMENT
'This is Employee table clustered by id sorted by age into 5 buckets' CLUSTERED
BY(ID) SORTED BY(AGE)INTO 5 BUCKETS STORED
AS ORC tblproperties (“orc.compress” = “SNAPPY”); Let me know if you face any issue
... View more