I have one table. When I compare the query result from impala & hive, i found different value for some column.
The detail is like below
that is same record, but for some column the value is not same.
please help me if you has any advice
Note: sorry for my english
There could be two possibilities for this scenario
1. May be the hive and impala tables are referring to the two different files. But chances are less for this scenario unless any minor mistakes in the tables (or) some other internal error
You can confirm this by
> describe formatted db.tablename
Run this command from both hive and impala and get the location and compare
2. Your file has duplicate records. I mean some key values are same but other columns may have different value. So it may return different value when you filter. So check your data in detail
To debug wrong results, it's very helpful for us to get an Impala query profile of the query that returns wrong results. Would you be able to provide that to help us debug?
1) I have compare it & it is in the same location
2) actualy, value from column x in hive, show in column y in impala, please see below image for detail
base on above image, we can see. value of n i p column in hive, show in am_name_senior in impala.