01-17-2019 04:19 AM
The HIVE performance in CDH 5.14.2 is more faster than hive in CDH6.0.1. The same hive SQL is fast in 5.14.2, but it's too slow in CDH 6.0.1. it's only have count(distinct) which has this issue. This question is similar to
01-19-2019 06:47 PM - edited 01-19-2019 06:54 PM
In produce environment,I use hiveOnSpark. Below is an simple sql,but it's too slow.
select count(1) as pv, count(distinct user_id) as uv from ods_action_d where dt>='2018-11-09' and dt<='2018-11-10' group by biz