Member since 
    
	
		
		
		09-17-2015
	
	
	
	
	
	
	
	
	
	
	
	
	
	
			
      
                103
            
            
                Posts
            
        
                61
            
            
                Kudos Received
            
        
                18
            
            
                Solutions
            
        My Accepted Solutions
| Title | Views | Posted | 
|---|---|---|
| 2931 | 06-15-2017 11:58 AM | |
| 3047 | 06-15-2017 09:18 AM | |
| 3525 | 06-09-2017 10:45 AM | |
| 1817 | 06-07-2017 03:52 PM | |
| 4280 | 01-06-2017 09:41 PM | 
			
    
	
		
		
		09-14-2016
	
		
		12:46 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 if you loose 2 ZK then NN will stay up but if it goes down the failover won't occur.  if you loose 2 JNs your NNs will go down. 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		09-14-2016
	
		
		12:35 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 that's correct; Indeed, if you have only one JN left you'll get into troubles with namenodes going down, same thing for zookeeper which must have a quorum  from zookeeper doc : Three ZooKeeper servers is the minimum recommended size for an ensemble 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		09-12-2016
	
		
		03:45 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
	
		3 Kudos
		
	
				
		
	
		
					
							 Hi Tim,   Rack awareness is related to how blocks are replicated amongst a cluster, so you won't loose any data (and improve locality) by setting it; It's not related to how masters are configured, but you can ensure by configuring them accordingly that you won't get your cluster down.
Note that you possibly install more than 3 zk or journal nodes (has pros and cons though)  There's a nice 2-parts article on HCC: https://community.hortonworks.com/articles/43057/rack-awareness-1.html 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		09-08-2016
	
		
		07:02 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 thanks @Junping Du 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		09-07-2016
	
		
		03:51 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
	
		5 Kudos
		
	
				
		
	
		
					
							 and what's the exact purpose of this?  I understood the default value (10) is very low and that we should put that to cluster nodes number, what would be the impact of setting it to, say 1000 or even 100000?  thanks experts ! 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
		
			
				
						
							Labels:
						
						
		
			
	
					
			
		
	
	
	
	
				
		
	
	
- Labels:
 - 
						
							
		
			Apache Hadoop
 - 
						
							
		
			Apache YARN
 
			
    
	
		
		
		08-17-2016
	
		
		10:05 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 Tez, Pig, Sqoop... are clients so they won't be "green" like HDFS or YARN 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		07-27-2016
	
		
		04:07 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
	
		1 Kudo
		
	
				
		
	
		
					
							 @bigdata.neophyte kerberizing the sandbox is not as tough as you may think, you can find some guidelines here, or even simplier with freeIPA : here or here 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		07-18-2016
	
		
		03:50 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
	
		1 Kudo
		
	
				
		
	
		
					
							@Sunile Manjee HA is only related to Oozie server, which is stateless (rely on its backend database).  Since, failover can only be seen from a job submission perspective so there's no impact on current running jobs. 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		07-14-2016
	
		
		07:46 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
	
		1 Kudo
		
	
				
		
	
		
					
							 you're apparently running out of file descriptors, try to adjust your ulimit accordingly 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		07-11-2016
	
		
		08:08 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
	
		1 Kudo
		
	
				
		
	
		
					
							 @Abhishek Srivastava  You can find hive-site.xml in /etc/hive/conf/, but if you're using Ambari to manage your cluster then do the modification in Ambari so it can deploy it to every host. 
						
					
					... View more