Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Tutorial Exercise 2: select * from intermediate_access_logs not working as expected, why?

avatar
Contributor

In Hive query editor, it shows "The operation has no results.".

In Impala, it shows:

Your query has the following error(s):

AnalysisException: Failed to load metadata for table: 'intermediate_access_logs' CAUSED BY: TableLoadingException: Failed to load metadata for table: intermediate_access_logs CAUSED BY: InvalidStorageDescriptorException: Impala does not support tables of this type. REASON: SerDe library 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe' is not supported.

 

Thanks.

1 ACCEPTED SOLUTION

avatar
Contributor

I repeated the process and it works now, but the last script is showing no result, I will open a different post for that.

View solution in original post

3 REPLIES 3

avatar
Contributor

I double checked and confirmed the log exists in hdfs:

 

[cloudera@quickstart examples]$ hadoop fs -ls /user/hive/warehouse/original_access_logs
Found 1 items
-rw-r--r-- 1 hdfs supergroup 39593868 2016-05-28 06:23 /user/hive/warehouse/original_access_logs/access.log.2

 

 

avatar
Contributor

I repeated the process and it works now, but the last script is showing no result, I will open a different post for that.

avatar
Guru
intermediate_access_logs was created as part of the ETL process in the
tutorial. That process is done via Hive because it uses Hive SerDe's and
other Hive-only features. The final table created in that process
(tokenized_access_logs, if I remember correctly) is the one you should be
able to query in Impala. Also, don't forget to 'invalidate metadata' when
the ETL process is finished, since Impala doesn't cache metadata.