About gopalv

kdmandawe · ‎02-22-2019

So I was scratching my head a lot too on finding out what "blessed" UDF is supposed to mean. Luckily I was able to make UDFs work on LLAP. Some findings: Even if you're able to create UDFs using Hive shell (non-LLAP), invoking the functions on LLAP mode, will not work. You can invoke functions on non-LLAP sessions only. Executing `create function` scripts using LLAP connection(JDBC) and then invoking them immediately will not work. They won't even show when doing a `show functions;` command. Below are the steps I did to make it work on LLAP: 1. Write drop & create functions using JDBC and execute the same using Hive LLAP jdbc connection After executing above java application, you will not see your functions created yet (if you'd check it on Hive shell using `show functions;`) 2. Restart HiveServer2 Interactive and HiveServer2 on Ambari as shown: 3. After successful restart, connect to Hive shell and do a 'show functions;'. Voila! Your UDFs now appear as you wish. 4. After that, you should already be able to invoke your UDFs in LLAP mode Above steps worked for me using HDP 3 HTH, Kenneth

shashant_panwar · ‎01-27-2017

Also how to use this new HA URL to connect to Tableau?

TimothySpann · ‎10-06-2016

thanks, that worked

gopalv · ‎07-20-2016

Looks like your datanodes are dying from too many open files - check the nofiles setting for the "hdfs" user in /etc/security/limits.d/ If you want to bypass that particular problem by changing the query plan, try with set hive.optimize.sort.dynamic.partition=true;

mph · ‎07-27-2016

The cluster is fairly small as its mostly experimental but I have 3 out of the 4 nodes in the cluster that each have 4 vCores and 1GB of memory, with a global YARN minimum memory container size of 256MB. So when you say slots I'm assuming that would translate into 12 slots/containers potentially? i.e. a container representing 1vCore + 256MB. I had assumed that for the resource (CPU/RAM) available in my cluster that the query I'm running on the dataset sizes I'm working with i..e 30-40k records would be more than enough?

gopalv · ‎06-04-2016

ORC is considering adding a faster decompression in 2016 - zstd (ZStandard). The enum values for that has already been reserved, but until we work through the trade-offs involved in ZStd - more on that sometime later this year. https://issues.apache.org/jira/browse/ORC-46 But bigger wins are in motion for ORC with LLAP, the in-memory format for LLAP isn't compressed at all - so it performs like ORC without compression overheads, while letting the cold data on disk sit around in Zlib.

nsabharwal · ‎10-21-2015

@cliu@hortonworks.com This is very helpful benchmarks posted by Amplab. Click

Online	Offline
Last Visited	‎05-11-2020 09:14 PM

Member Since	‎09-28-2015 08:23 PM
Last Visited	‎05-11-2020 09:14 PM
Posts	41
Kudos received	44

Cloudera Community

Re: How are UDF's treated with Hive LLAP?

Re: Hive LLAP CLI Usage

Re: What happens when a hive partition is queried ...

Re: how to show a query is using LLAP

Re: org.apache.hadoop.hive.ql.metadata.HiveExcepti...

Re: How are UDF's treated with Hive LLAP?

Re: Hive LLAP CLI Usage

Re: Error Configuring Hive LLAP

Re: org.apache.hadoop.hive.ql.metadata.HiveExcepti...

Re: Hive query running on Tez contains a Mapper th...

Re: Snappy vs. Zlib - Pros and Cons for each compr...

Re: Can you please advise about how best to use th...