Member since
06-07-2016
923
Posts
322
Kudos Received
115
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
3352 | 10-18-2017 10:19 PM | |
3717 | 10-18-2017 09:51 PM | |
13434 | 09-21-2017 01:35 PM | |
1390 | 08-04-2017 02:00 PM | |
1851 | 07-31-2017 03:02 PM |
08-03-2016
04:42 PM
@Eon kitex Hortonworks does a lot of integration testing before it releases any new version of HDP. It means that Hortons makes sure that all components being part of HDP have been tested to work together so that they are supportable and enterprise ready. You cannot upgrade individual components. If you want to just test spark 2.0, you can download it separately on a different machine and evaluate it. But you cannot upgrade just spark HDP 2.3.2. When Hortonworks starts supporting Spark 2.0, it will be included in newer versions of HDP.
... View more
08-03-2016
04:38 PM
@Kumar Veerappan This is confusing. Not sure what's causing it. But to find the right spark version, just run your spark-shell. It shows spark version when it starts.
... View more
08-02-2016
10:53 PM
2 Kudos
@Vipin Rathor I figured this one out. Here is the thing. spark doesn't allow to submit keytab and principal with proxy-user. The reason is you need keytab and principal when you are running long running jobs. In this case, for long running jobs, the keytab will be copied to application master staging area and this keytab and principal will be used to renew delegation tokens required for HDFS. This enables the application to continue working with any security issue. Remember this feature is explicitly for long running applications. See details here under YARN mode, second paragraph. Imagine if all application users logging into my applications can proxy to my keytab user since the yarn application will be submitted as the proxy-user but it will copy the keytab. This means those users can read the contents of that keytab. This is a huge security flaw. And that's why what I was trying above is not allowed. I have to do what Hive does to run "spark-submit". Basically kinit before submitting my application and then provide a proxy-user. So here is how I solved it. kinit <principal>@<REALM> -k -t <mykeytab file> spark-submit <all my options include --proxy-user <my proxy user>> This way my proxy-user cannot read keytab contents and is only used as a proxy-user. My application is not long running (like more than 7 days which is the usual lifetime of a TGT), so I am fine.
... View more
08-02-2016
10:41 PM
@kavitha velaga I think they are both same and hive community may have changed to the newer "doAs". Try setting hive.server.enable.impersonation to false. I don't think it will make a difference. The property used is hive.server2.enable.doAs
... View more
08-02-2016
06:37 PM
@Saurabh Kumar You might be following a different set of instructions. The one i am looking at uses solr-yarn.jar and here is the format hadoop jar solr-yarn.jar org.apache.solr.cloud.yarn.SolrClient -nodes=2 -zkHost=localhost:2181 -solr=hdfs://localhost:9000/solr/solr.tgz -jar=hdfs://localhost:9000/solr/solr-yarn.jar -memory 512 -hdfs_home=hdfs://localhost:9000/solr/index_data This is where I think you should be able to add "-queue=<queue name>" did you try following the instructions on this link? the class name which supports "queue" is not in the command you are running. Is it possible for you to try the following? https://lucidworks.com/blog/2014/12/01/solr-yarn/
... View more
08-02-2016
04:20 PM
@kishore sanchina nofile is the number of open files a linux user can open in a session. Check this link for more details. There is a soft and hard limit for this. nproc is the number of processes a linux user may open in a session. This link may also help you understand how this works.
... View more
08-02-2016
03:52 PM
@Saurabh Kumar I think you can use an option "queue" to specify the queue. Please see line 133 of SolrClient.java here. https://github.com/lucidworks/yarn-proto/blob/master/src/main/java/org/apache/solr/cloud/yarn/SolrClient.java Also, follow this instruction if you run into any issue. https://lucidworks.com/blog/2014/12/01/solr-yarn/
... View more
08-01-2016
09:10 PM
@Raja A I will try this on my machine when I get a chance, hopefully later today but I think it should "input.regex"="^\\^\\^" The first one signifies the beginning of a string and other two are for your matching pattern.
... View more
08-01-2016
07:12 PM
Hi @Raja A In both cases the second one has only single slash. Can you try it with two slashes. I think the first one should work if you just add two slashes. Also, why not try to make it work with only "^^" first. Figure out how to make ^^ work and then you can simply extend that to ^^^^^^^^^^.
... View more
08-01-2016
05:43 PM
@Raja A I will try \\^ for each character. so basically \\^\\^\\^\\^\\^\\^\\^\\^\\^\\^ Can you try this?
... View more