Reply
Cloudera Employee
Posts: 39
Registered: ‎12-14-2016

Re: Exercise 2 : Long time running + error from intermediate_access_logs table creation query

Absolutely! You could also mark the solution for anyone else that comes across this issue in the future.

 

Cheers

New Contributor
Posts: 2
Registered: ‎01-29-2018

Re: Exercise 2 : Long time running + error from intermediate_access_logs table creation query

I added the jar file and still my query is running for the last 8 minutes.

CREATE EXTERNAL TABLE intermediate_access_logs (
ip STRING,
date STRING,
method STRING,
url STRING,
http_version STRING,
code1 STRING,
code2 STRING,
dash STRING,
user_agent STRING)
ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe'
WITH SERDEPROPERTIES (
'input.regex' = '([^ ]*) - - \\[([^\\]]*)\\] "([^\ ]*) ([^\ ]*) ([^\ ]*)" (\\d*) (\\d*) "([^"]*)" "([^"]*)"',
'output.format.string' = "%1$$s %2$$s %3$$s %4$$s %5$$s %6$$s %7$$s %8$$s %9$$s")
LOCATION '/user/hive/warehouse/original_access_logs';

Any other setting change needs to be done?
Announcements