I am using all current parcels CM 4.8 CDH 4.5 and Impala 1.2.2
Steps to reproduce:
- start clean, so checek in CM, impala -> queries, this shows clean, no running queries.
- go on Hue and Impala, submit a simple query like select * from tableA limit 1. Notice the result is displayed correct.
- go back to CM, impala -> Queries, here I see 4 running queries:
2. USE default
3. select * from tableA limit 1
Sure these might all be triggered by the simple query I submitted, but the query is done. Why are they showing running. Matter the fact, they will continue to show as running in 30 mins, next day and so on. Until impala is restarted.
Not sure if this is related, but when I run the same query with limit 1000, I get an ArrayIndexOutOfBoundsException. Anyone has experienced this? Please help. I don't think it is the data because Hive runs fine. The tableA is in HBase and is an external Hive table.
Unfortunately, this is a known issue with Hue not closing the various queries it issues to Impala. The queries remain open as can be seen in CM as well as the Impala web UIs. See https://issues.cloudera.org/browse/HUE-1455 and https://issues.cloudera.org/browse/HUE-994. I know these are both fixed in the upcoming CDH5 release, but am not sure if they have both made it into a Hue in the CDH4 line. I am not aware of any workaround.
Can this issue also causing the ArrayIndexOutofBondException that is described in my original post?