Member since 
    
	
		
		
		05-24-2020
	
	
	
	
	
	
	
	
	
	
	
	
	
	
			
      
                2
            
            
                Posts
            
        
                0
            
            
                Kudos Received
            
        
                0
            
            
                Solutions
            
        
			
    
	
		
		
		03-10-2021
	
		
		01:59 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 @GangWar   I don't think this is a problem with host resolution.  "glbgnameservice" is the value of our hdfs namespace,on this host I can access HDFS through shell command.I did not make any changes. After restarting the nodemanager process, the log of the nodemanager process is normal.The node with the problem becomes normal after restarting the nodemanager process. After a few days, other nodes that have not restarted may have the same problem. Finally, I restarted the entire yarn, and now it is under observation. 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		03-07-2021
	
		
		09:57 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 I have a cluster that has been running for two years. Recently, some nodemanager nodes will be abnormal, and it will be normal after restarting the nodemanager process. 
   
 2021-03-05 05:49:07,751 WARN org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: { hdfs://glbgnameservice/user/test/.sparkStaging/application_1614380939655_10466/hadoop-mapreduce-client-app-2.6.0-cdh5.8.0.jar, 1614923248393, FILE, null } failed: java.net.UnknownHostException: glbgnameservice  java.lang.IllegalArgumentException: java.net.UnknownHostException: glbgnameservice  at org.apache.hadoop.security.SecurityUtil.buildTokenService(SecurityUtil.java:406)  at org.apache.hadoop.hdfs.NameNodeProxies.createNonHAProxy(NameNodeProxies.java:310)  at org.apache.hadoop.hdfs.NameNodeProxies.createProxy(NameNodeProxies.java:176)  at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:708)  at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:651)  at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)  at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2696)  at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:94)  at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2733)  at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2715)  at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:382)  at org.apache.hadoop.fs.Path.getFileSystem(Path.java:296)  at org.apache.hadoop.yarn.util.FSDownload.copy(FSDownload.java:249)  at org.apache.hadoop.yarn.util.FSDownload.access$000(FSDownload.java:61)  at org.apache.hadoop.yarn.util.FSDownload$2.run(FSDownload.java:359)  at org.apache.hadoop.yarn.util.FSDownload$2.run(FSDownload.java:357)  at java.security.AccessController.doPrivileged(Native Method)  at javax.security.auth.Subject.doAs(Subject.java:422)  at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1693)  at org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:356)  at org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:60)  at java.util.concurrent.FutureTask.run(FutureTask.java:266)  at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)  at java.util.concurrent.FutureTask.run(FutureTask.java:266)  at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)  at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)  at java.lang.Thread.run(Thread.java:748)  Caused by: java.net.UnknownHostException: glbgnameservice  ... 27 more  Caused by: glbgnameservice  java.net.UnknownHostException: glbgnameservice  at org.apache.hadoop.security.SecurityUtil.buildTokenService(SecurityUtil.java:406)  at org.apache.hadoop.hdfs.NameNodeProxies.createNonHAProxy(NameNodeProxies.java:310)  at org.apache.hadoop.hdfs.NameNodeProxies.createProxy(NameNodeProxies.java:176)  at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:708)  at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:651)  at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)  at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2696)  at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:94)  at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2733)  at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2715)  at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:382)  at org.apache.hadoop.fs.Path.getFileSystem(Path.java:296)  at org.apache.hadoop.yarn.util.FSDownload.copy(FSDownload.java:249)  at org.apache.hadoop.yarn.util.FSDownload.access$000(FSDownload.java:61)  at org.apache.hadoop.yarn.util.FSDownload$2.run(FSDownload.java:359)  at org.apache.hadoop.yarn.util.FSDownload$2.run(FSDownload.java:357)  at java.security.AccessController.doPrivileged(Native Method)  at javax.security.auth.Subject.doAs(Subject.java:422)  at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1693)  at org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:356)  at org.apache.hadoop.yarn.util.FSDownload.call(FSDownload.java:60)  at java.util.concurrent.FutureTask.run(FutureTask.java:266)  at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)  at java.util.concurrent.FutureTask.run(FutureTask.java:266)  at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)  at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)  at java.lang.Thread.run(Thread.java:748) 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
		
			
				
						
							Labels:
						
						
		
			
	
					
			
		
	
	
	
	
				
		
	
	
- Labels:
- 
						
							
		
			Apache Hadoop
- 
						
							
		
			Apache YARN
 
        
