Member since
05-20-2016
12
Posts
0
Kudos Received
0
Solutions
11-01-2016
07:49 AM
If there is no way to do this, having confirmation of that would also be helpful.
... View more
10-25-2016
12:03 PM
In HUE's Job Browser, I can see jobs, which for us is a bunch of different hive queries submitted by several users. How can I view the SQL for a given job_id? When I see a query that's been running for a long time, I want to know what it is.
... View more
Labels:
- Labels:
-
Apache Hive
-
Cloudera Hue
09-21-2016
01:56 PM
I saw in the link below that the character limit for Actions has been expanded from 50 to 128. https://issues.apache.org/jira/browse/OOZIE-2168 Does CDH 5.7 have this newer version of Oozie that allows 128 character names for Actions?
... View more
Labels:
- Labels:
-
Apache Oozie
09-02-2016
11:54 AM
Thanks! Created JIRA at: https://issues.apache.org/jira/browse/SQOOP-3005
... View more
08-25-2016
01:47 PM
From what I understand, for the lastmodified update method, Sqoop selects records where timestamp_column >= last modified timestamp and timestamp column < current_time. Is there a way to customize that current_time upper bound? Can I do something like current_time - 1 hour? We have transactions being created on one server, then replicated to another server, then Sqooped from there. I noticed some missing data in our cluster today, and suspect replication delay as the root cause.
... View more
Labels:
- Labels:
-
Apache Sqoop
05-21-2016
10:57 AM
Interesting, is the HPL/SQL feature of Hive 2.0 available in CDH 5.7?
... View more
05-21-2016
07:12 AM
Hmm, looks like the functionality I'm looking for is called Dynamic Partition Pruning? (in Impala 2.5 / CDH 5.7) https://dzone.com/articles/apache-impala-incubating-in-cdh-57-4x-faster-for-b
... View more
05-21-2016
06:17 AM
Is it possible to take advantage of partition pruning without specifying the value of the partition column as a constant in the query? Is there a way to get it from another joined small table? In this example, table wallet_month has 1 record with column month_of_year, that has value '2016-05'. Table transaction_detail is partitioned on month_of_year, and has 400 partitions. Looking at the Explain plan, below query still uses all 400 paritions from TD. SELECT count(*) FROM wallet_month WM JOIN transaction_detail TD AND TD.month_of_year = WM.month_of_year Adding a where clause (WHERE WM.month_of_year = '2016-05') gives the desired behavior of only using 1 partition from TD, but I'd like to know if there's another way.
... View more
05-20-2016
10:25 PM
Is dynamic SQL possible in Impala? or Hive? And by that I mean the ability to execute a string like a statement, like: exec ('select 1')
... View more
Labels:
- Labels:
-
Apache Hive
-
Apache Impala