Member since
05-30-2018
1322
Posts
715
Kudos Received
148
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 4025 | 08-20-2018 08:26 PM | |
| 1930 | 08-15-2018 01:59 PM | |
| 2361 | 08-13-2018 02:20 PM | |
| 4077 | 07-23-2018 04:37 PM | |
| 4993 | 07-19-2018 12:52 PM |
04-12-2016
01:33 PM
@michael perez To add to Emil, to isolate CPU and Ram resources enable CPU scheduling. More info here. CPU scheduling represents one aspect of YARN resource management capabilities that includes CGroups, node labels, archival storage, and memory as storage. CGroups should be used with CPU scheduling to constrain and manage CPU processes.
... View more
04-12-2016
08:15 AM
1 Kudo
@Divya Gehlot Couple suggestions HBase is not performant for scans as it is a db for random reads/writes. If scans are to be performs do it on the key and not the columns.
... View more
04-12-2016
08:10 AM
Does Cloudbreak support launching instances on vSphere? I get the feeling it might be through SPI but I don't see any working examples or references where anyone got this up and running.
... View more
Labels:
- Labels:
-
Hortonworks Cloudbreak
04-10-2016
03:06 AM
disable iptables http://www.cyberciti.biz/faq/turn-on-turn-off-firewall-in-linux/ disable selinux http://www.crypt.gen.nz/selinux/disable_selinux.html
... View more
04-08-2016
09:53 PM
I am interested in pruning entries manually from the name node. If orphan entries exist in namenode (where namenode thinks file exist when it does not) how do I manually remove from namenode metadatea?
... View more
Labels:
- Labels:
-
Apache Hadoop
04-07-2016
02:30 AM
@Michael Rife On ambari home page what does it show in terms of resource usage on your sandbox? Are their any orphan jobs running?
... View more
04-06-2016
09:08 PM
straight out of the docs:
SET pig.splitCombination false;
A = LOAD 'hawq://mdw1:5432/gpadmin' USING com.gopivotal.pig.HawqLoader('retail_demo.products_dim', 'gpadmin');
DESCRIBE A;
A: {product_id: int,category_id: int,price: bigdecimal,product_name: chararray}
B = LOAD 'hawq://mdw1:5432/gpadmin' USING com.gopivotal.pig.HawqLoader('retail_demo.vinyl_counts', 'gpadmin');
DESCRIBE B;
B: {category_id: int,count: int}
C = JOIN A BY category_id, B by category_id USING 'replicated';
STORE C INTO '$output';
... View more
04-06-2016
06:14 PM
@Amit SharmaDid this resolve your issue?
... View more
04-05-2016
06:21 PM
@also take a look here about spark security. SASL encryption is currently supported for the block transfer service when authentication ( spark.authenticate ) is enabled. To enable SASL encryption for an application, set spark.authenticate.enableSaslEncryption to true in the application’s configuration. When using an external shuffle service, it’s possible to disable unencrypted connections by setting spark.network.sasl.serverAlwaysEncrypt to true in the shuffle service’s configuration. If that option is enabled, applications that are not set up to use SASL encryption will fail to connect to the shuffle service.
... View more
04-05-2016
06:18 PM
Per Known issues, bug BUG-39674 - Spark does not yet support wire encryption, dynamic executor allocation, SparkR, GraphX, Spark Streaming, iPython, or Zeppelin. link here. also similar question was asked here.
... View more