Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Hive Insert query to table(have partitions) failing with vectorization enabled

avatar
Contributor

hive-logs.txtHello Everyone,

Hive Insert query to table(have partitions) failing with vectorization enabled . It was working fine till last week.However it is running fine with vectorization disabled. Can someone help in identifying the issue.

please find the logs with vectorization enabled and disabled/turned off.

1 ACCEPTED SOLUTION

avatar

Hi @Jalender!
Looking at your logs, it seems that Hive with Vectorization cannot process a date value.
Guess you're hitting this -> https://issues.apache.org/jira/browse/HIVE-9862
A clearly and further explanation below (on Limitations part):
https://cwiki.apache.org/confluence/display/Hive/Vectorized+Query+Execution

Could you confirm if there is any date out of the bounds?
Hope this helps!

View solution in original post

4 REPLIES 4

avatar

Hi @Jalender!
Looking at your logs, it seems that Hive with Vectorization cannot process a date value.
Guess you're hitting this -> https://issues.apache.org/jira/browse/HIVE-9862
A clearly and further explanation below (on Limitations part):
https://cwiki.apache.org/confluence/display/Hive/Vectorized+Query+Execution

Could you confirm if there is any date out of the bounds?
Hope this helps!

avatar
Contributor

@Vinicius Higa Murakami Thanks for your reply.

if we enable vectorization it corrupts timestamp values every time or it's intermittent ? because with vectorization enabled it was working fine, we are started to see this happening recently.

avatar

Hey @Jalender!
Actually, it should work fine indeed.
Basically, you will face issues only when your datetime value is out of the range specified on that link, 1677-09-20 and 2262-04-11. So if you have any datetime more or less of this range, you'll certainly hit that fail msg.
Hope this helps!

avatar

You don't really need to turn off the vectorization, just need to clean any date outside of the range specified for vectorization 🙂