About aervits

aervits · ‎01-06-2016

@Raja Ray are all standard requirements set, i.e. ulimit, swappiness? Also, can you check the disk health? Also, what OS are you running, in case of RPM based, do you have Transparent Huge Pages off?

aervits · ‎01-06-2016

I think this calls for a jira with Ambari for advisor?

aervits · ‎01-06-2016

@Sebastian Hans it's a known bug with Mac OSX 10.11. Please wait for the next version, more details here https://community.hortonworks.com/articles/3364/hi...

aervits · ‎01-06-2016

Glad that's its not an abandoned feature. Are there more examples and/or docs available? I created a few of my own but I think we need better examples. Thank you @gopal

aervits · ‎01-05-2016

if you use tarball, then Windows 7 with 4GB of RAM is fine. Yes, you'd need to download the Apache distribution for that. If you use Ambari, it will automatically pull Hortonworks distribution and you can pick and choose what to install, thereby overcoming your memory limit.

aervits · ‎01-05-2016

great, please try your sqoop query with --direct flag, should improve your performance in general. Other than that, please accept one of the answers to close the issue.

aervits · ‎01-05-2016

Groovy UDF example Can be compiled at run time Currently only works in "hive" shell, does not work in beeline <code>su guest hive paste the following code into the hive shellthis will use Groovy String replace function to replace all instances of lower case 'e' with 'E' <code>compile `import org.apache.hadoop.hive.ql.exec.UDF \; import org.apache.hadoop.io.Text \; public class Replace extends UDF { public Text evaluate(Text s){ if (s == null) return null \; return new Text(s.toString().replace('e', 'E')) \; } } ` AS GROOVY NAMED Replace.groovy; now create a temporary function to leverage the Groovy UDF <code>CREATE TEMPORARY FUNCTION Replace as 'Replace'; now you can use the function in your SQL <code>SELECT Replace(description) FROM sample_08 limit 5; full example <code>hive> compile `import org.apache.hadoop.hive.ql.exec.UDF \; > import org.apache.hadoop.io.Text \; > public class Replace extends UDF { > public Text evaluate(Text s){ > if (s == null) return null \; > return new Text(s.toString().replace('e', 'E')) \; > } > } ` AS GROOVY NAMED Replace.groovy; Added [/tmp/0_1452022176763.jar] to class path Added resources: [/tmp/0_1452022176763.jar] hive> CREATE TEMPORARY FUNCTION Replace as 'Replace'; OK Time taken: 1.201 seconds hive> SELECT Replace(description) FROM sample_08 limit 5; OK All Occupations ManagEmEnt occupations ChiEf ExEcutivEs GEnEral and opErations managErs LEgislators Time taken: 6.373 seconds, Fetched: 5 row(s) hive> Another example this will duplicate any String passed to the function <code>compile `import org.apache.hadoop.hive.ql.exec.UDF \; import org.apache.hadoop.io.Text \; public class Duplicate extends UDF { public Text evaluate(Text s){ if (s == null) return null \; return new Text(s.toString() * 2) \; } } ` AS GROOVY NAMED Duplicate.groovy; CREATE TEMPORARY FUNCTION Duplicate as 'Duplicate'; SELECT Duplicate(description) FROM sample_08 limit 5; All OccupationsAll Occupations Management occupationsManagement occupations Chief executivesChief executives General and operations managersGeneral and operations managers LegislatorsLegislators JSON Parsing UDF <code>compile `import org.apache.hadoop.hive.ql.exec.UDF \; import groovy.json.JsonSlurper \; import org.apache.hadoop.io.Text \; public class JsonExtract extends UDF { public int evaluate(Text a){ def jsonSlurper = new JsonSlurper() \; def obj = jsonSlurper.parseText(a.toString())\; return obj.val1\; } } ` AS GROOVY NAMED json_extract.groovy; CREATE TEMPORARY FUNCTION json_extract as 'JsonExtract'; SELECT json_extract('{"val1": 2}') from date_dim limit 1; 2

aervits · ‎01-05-2016

I am talking about a separate install, not Sandbox. Sandbox requires 8GB RAM. You said you want to learn, one way to learn is to install from scratch, Sandbox is great for tutorials and some admin work but to really learn the ins and outs, you need to try it out on a vanilla OS box.

aervits · ‎01-05-2016

with 4GB memory the only way you can learn Hadoop is to either install Apache Hadoop on your host machine or run a VM with 2GB of RAM and install Ambari. After installing Ambari, you can install the rest of the stack, this is a great learning experience as how I got started with Hadoop and Hortonworks.

aervits · ‎01-05-2016

Ambari will restart everything that has stale configs, for you to take advantage of both worlds, (restarting stale configs and keeping cluster up), go through each host and restart components with stale configs per node, rather than per cluster as you were doing.

Online	Offline
Last Visited	‎08-15-2019 06:35 AM

Member Since	‎10-01-2015 11:46 AM
Last Visited	‎08-15-2019 06:35 AM
Posts	3,933
Kudos received	1074

Cloudera Community

Re: Where can I get latest resource_management.c...

Re: How to Kerberize Flume?

Re: Load Hive Table form Pig Output File.

Re: HDP 2.6 Cluster Issues with Hive Metastore

Re: which HDP release will storm 1.1.0 be packaged...

Re: Hbase region server is getting stopped frequen...

Re: Can phoenix local indexes create a deadlock du...

Re: Hive ODBC Driver does not install on Mac OS 10...

Re: status of groovy custom udfs in beeline

Re: What is 100% free method to install HDP sandbo...

Re: SQoop job too slow importing data from Teradat...

Apache Hive Groovy UDF examples

Re: What is 100% free method to install HDP sandbo...

Re: What is 100% free method to install HDP sandbo...

Re: Can phoenix local indexes create a deadlock du...