Member since
09-17-2015
103
Posts
61
Kudos Received
18
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
2271 | 06-15-2017 11:58 AM | |
2052 | 06-15-2017 09:18 AM | |
1961 | 06-09-2017 10:45 AM | |
1391 | 06-07-2017 03:52 PM | |
3024 | 01-06-2017 09:41 PM |
09-14-2016
12:46 PM
if you loose 2 ZK then NN will stay up but if it goes down the failover won't occur. if you loose 2 JNs your NNs will go down.
... View more
09-14-2016
12:35 PM
that's correct; Indeed, if you have only one JN left you'll get into troubles with namenodes going down, same thing for zookeeper which must have a quorum from zookeeper doc : Three ZooKeeper servers is the minimum recommended size for an ensemble
... View more
09-12-2016
03:45 PM
3 Kudos
Hi Tim, Rack awareness is related to how blocks are replicated amongst a cluster, so you won't loose any data (and improve locality) by setting it; It's not related to how masters are configured, but you can ensure by configuring them accordingly that you won't get your cluster down.
Note that you possibly install more than 3 zk or journal nodes (has pros and cons though) There's a nice 2-parts article on HCC: https://community.hortonworks.com/articles/43057/rack-awareness-1.html
... View more
09-08-2016
07:02 AM
thanks @Junping Du
... View more
09-07-2016
03:51 PM
5 Kudos
and what's the exact purpose of this? I understood the default value (10) is very low and that we should put that to cluster nodes number, what would be the impact of setting it to, say 1000 or even 100000? thanks experts !
... View more
Labels:
- Labels:
-
Apache Hadoop
-
Apache YARN
08-17-2016
10:05 AM
Tez, Pig, Sqoop... are clients so they won't be "green" like HDFS or YARN
... View more
07-27-2016
04:07 PM
1 Kudo
@bigdata.neophyte kerberizing the sandbox is not as tough as you may think, you can find some guidelines here, or even simplier with freeIPA : here or here
... View more
07-18-2016
03:50 PM
1 Kudo
@Sunile Manjee HA is only related to Oozie server, which is stateless (rely on its backend database). Since, failover can only be seen from a job submission perspective so there's no impact on current running jobs.
... View more
07-14-2016
07:46 AM
1 Kudo
you're apparently running out of file descriptors, try to adjust your ulimit accordingly
... View more
07-11-2016
08:08 AM
1 Kudo
@Abhishek Srivastava You can find hive-site.xml in /etc/hive/conf/, but if you're using Ambari to manage your cluster then do the modification in Ambari so it can deploy it to every host.
... View more