PROBLEM : Query fails with the below mentioned exception.
ERROR :
java.util.concurrent.FutureTask.run(FutureTask.java:262)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:745)
Caused by: java.lang.RuntimeException: java.io.IOException: java.lang.NullPointerException
at org.apache.hadoop.mapred.split.TezGroupedSplitsInputFormat$TezGroupedSplitsRecordReader.initNextRecordReader(TezGroupedSplitsInputFormat.java:192)
at org.apache.hadoop.mapred.split.TezGroupedSplitsInputFormat$TezGroupedSplitsRecordReader.<init>(TezGroupedSplitsInputFormat.java:131)
ROOT CAUSE :
Query fails on a field of datatype timestamp within an ORC table having null values, provided Predicate push down is enabled.
This is a known issue. Reference: https://hortonworks.jira.com/browse/BUG-47472
WORKAROUND / RESOLUTION :
This setting is whether to enable automatic use of indexes or not. There could come with some performance penalties.