Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Hawq Queries Very Slow

Highlighted

Hawq Queries Very Slow

New Contributor

Hello

Im trying Pivotal Hawq with ambari and now im trying to run some queries over hive tables with hawq.

From what i have seen Hawq can query hive tables through HCatalog (https://community.hortonworks.com/articles/43264/hawqhdb-and-hadoop-with-hive-and-hbase.html ), and so, i use psql tool on the comand line to run queries like this:

SELECT * FROM hcatalog.hive-db-name.hive-table-name;

Previously i run some queries on Hive to compare results with Hawq, i was expecting hawq to be much faster, but hawq its being much more slow, the query response is much more long than in Hive. The specfic query that i am trying to run is query 1 from TPCH on hive table stored as ORC. Hive took 18 seconds, running the query in psql tool with hcatalog 6 minutes and 28s.

Can someone explain why is this happening?

2 REPLIES 2

Re: Hawq Queries Very Slow

Contributor

What version of HAWQ/HDB is it? What PXF profile are you using? You should try the new HiveORC profile in HDB 2.2.0.0, if you haven't:

http://hdb.docs.pivotal.io/220/hawq/pxf/HivePXF.html#hiveorc-intro

Re: Hawq Queries Very Slow

New Contributor

Hi,hcatalog just use one segment to deal with the data.

I think you can use hawq Managed table or external table (pxf) to select your tables

Don't have an account?
Coming from Hortonworks? Activate your account here