Created on 05-08-2018 03:18 AM - edited 09-16-2022 06:11 AM
Hi All,
I have one table. When I compare the query result from impala & hive, i found different value for some column.
The detail is like below
that is same record, but for some column the value is not same.
please help me if you has any advice
Note: sorry for my english
Thanks
Created 05-08-2018 03:48 AM
Created 05-08-2018 08:44 PM
thanks for your answer.
I have run that script & refresh [[dbname.]tablename], unfortunately, the result still same.
Created 05-09-2018 10:13 AM
There could be two possibilities for this scenario
1. May be the hive and impala tables are referring to the two different files. But chances are less for this scenario unless any minor mistakes in the tables (or) some other internal error
You can confirm this by
> describe formatted db.tablename
Run this command from both hive and impala and get the location and compare
2. Your file has duplicate records. I mean some key values are same but other columns may have different value. So it may return different value when you filter. So check your data in detail
Created 05-10-2018 03:31 PM
To debug wrong results, it's very helpful for us to get an Impala query profile of the query that returns wrong results. Would you be able to provide that to help us debug?
Created 05-10-2018 07:54 PM
@alex.behmwrote:To debug wrong results, it's very helpful for us to get an Impala query profile of the query that returns wrong results. Would you be able to provide that to help us debug?
Please see on this url for impala query profile.
Thanks
Created on 05-10-2018 07:43 PM - edited 05-10-2018 07:53 PM
1) I have compare it & it is in the same location
2) actualy, value from column x in hive, show in column y in impala, please see below image for detail
base on above image, we can see. value of n i p column in hive, show in am_name_senior in impala.
Thanks