Member since
07-31-2013
1924
Posts
462
Kudos Received
311
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1634 | 07-09-2019 12:53 AM | |
10114 | 06-23-2019 08:37 PM | |
8307 | 06-18-2019 11:28 PM | |
9056 | 05-23-2019 08:46 PM | |
3679 | 05-20-2019 01:14 AM |
05-04-2014
12:18 AM
You could do it via some of the date based built-in functions at http://www.cloudera.com/content/cloudera-content/cloudera-docs/Impala/latest/Installing-and-Using-Impala/ciiu_functions.html#datetime_functions_unique_1 I'd imagine something such as: unix_timestamp(to_date(event_end_timestamp)) - unix_timestamp(to_date(event_start_timetsamp)) Should give you an bigint return of the number of milliseconds.
... View more
05-03-2014
11:56 PM
Thanks for following up and reporting this!
... View more
03-17-2014
11:17 AM
Thank you Harsh !!!
... View more
03-07-2014
06:16 PM
Wherever possible, it is always good to combine multiple goals into the same job to avoid extra I/O and additional job bootup/cleanup overheads. Optimisers in Hive and Pig try to do this as well. The reducer can emit different outputs via the MultipleOutputs classes. For map outputs, you'll need to build some form of "general" class and hierarchy that can cater to different styles of output and partition them appropriately. Its definitely possible, but needs some additional work.
... View more
02-26-2014
10:17 AM
Thank you Harsh !!!
... View more
02-14-2014
07:37 AM
Thank you. Shame on me for not "finishing the job" as it were.
... View more
02-05-2014
01:33 AM
Thnx for the hint, to be complete: Sqooped data to Hive uses '\u0001' as a field delimiter. LzoTextLoader does not support adding a custom delimiter, use LzoTokenizedLoader for that. Works like a charm after that
... View more