About rmian

smartninja723 · ‎03-02-2016

I wanted to know couple of things here. 1) Suppose I've few map reduce jobs and they need to be run on the HDI. What I understand from HDI approach, it is for build, run and delete. If I've placed all my jars, oozie jobs, configurations on the cluster and if I delete them today. In future if I want to run the same batch job, do I need to copy all the jars, re configure the oozie jobs? 2) Is it possible to configure Solr run on HDInsights?

cgross · ‎01-27-2016

Some of the pages require tunneling: Manage HDInsight clusters by using the Ambari Web UI https://azure.microsoft.com/en-us/documentation/articles/hdinsight-hadoop-manage-ambari/

cgross · ‎01-27-2016

Make sure you use SSL via httpS. See https://azure.microsoft.com/en-us/documentation/articles/hdinsight-hadoop-manage-ambari/

cgross · ‎01-27-2016

Give Azure Data Factory (ADF) a try.

pbrahmbhatt · ‎11-06-2015

No way to kerberize Kafka in 2.2 If you are running storm-2.2 you should still be able to run a topology that uses storm-kafka connector's version from version 2.3 which should be able to read from a secure kafka cluster (HDP or not HDP).

sluangsay · ‎10-28-2015

If you use the same partitions for yarn intermediate data than for the HDFS blocks, then you might also consider setting the fs.datanode.du.reserved property, which reserves some space on those partitions for non-hdfs use (such as intermediate yarn data). One base recommendation I saw on my first Hadoop training long time ago was to dedicate 25% of the "data disks" for that kind of intermediate data. I guess the optimal answer should consider the maximum amount of intermediate data you can get at the same time (when launching a job, do you use all the data of HDFS as input data?) and dedicate the space for yarn.nodemanager.resource.local-dirs accordingly. I would also recommend turning on the property mapreduce.map.output.compress in order to reduce the size of the intermediate data.

Online	Offline
Last Visited	‎01-18-2016 10:20 PM

Member Since	‎09-28-2015 12:37 PM
Last Visited	‎01-18-2016 10:20 PM
Posts	14
Kudos received	2

Cloudera Community

Re: Sqooping data from LOCAL MSSQL Server to HDIns...

Re: Using HDInsight as a production Hadoop cluster...

Re: How to view Resource Manager UI in HDInsight o...

Re: Failing to login Ambari in HDInsight over Azur...

Re: Sqooping data from LOCAL MSSQL Server to HDIns...

Re: Can an external kerberized kafka cluster (non-...

Re: Recommended size for yarn.nodemanager.resource...