APAI am about to upgrade from cdh to cdp and I have some questions regarding new version of Hive.
Until now I used to have hive as etl service because it is more stable but slower than impala. My tables that bi users see are in impala.
My questions are:
1) Is hive 3 fast enough to compete impala ?
2) In case of bi use is it more appropriate to point hive or impala(I read that hive 3 uses cache and makes bi repeated requests faster)?
3) In case of kafka flow, is it appropriate to create an acid table in hive 3 and store the fetched data live ?