Member since 
    
	
		
		
		10-28-2014
	
	
	
	
	
	
	
	
	
	
	
	
	
	
			
      
                82
            
            
                Posts
            
        
                18
            
            
                Kudos Received
            
        
                17
            
            
                Solutions
            
        My Accepted Solutions
| Title | Views | Posted | 
|---|---|---|
| 2366 | 07-11-2019 08:49 AM | |
| 5426 | 08-02-2018 12:20 PM | |
| 8267 | 06-19-2018 02:19 PM | |
| 3866 | 06-19-2018 08:32 AM | |
| 3584 | 12-21-2017 08:19 AM | 
			
    
	
		
		
		07-11-2019
	
		
		12:14 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 You are correct in that Director will attempt to connect via each IP in the list, preferring to connect over the private IP. You should be fine if only one of the IPs is accessible. Director should cycle through the IPs until the configured timeout, default 20 minutes after instance allocation.     Is the cluster failing to bootstrap? There are some natural places where we would expect connectivity to fail in a transient manner. E.g., when the VM is first allocated but doesn't yet respond to ssh, when rebooting the VM. If you are seeing failures that are preventing the cluster from bootstrapping (or even causing individual instances to fail) then we would be interested in seeing the log files to investigate. 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		07-11-2019
	
		
		08:49 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 dturner,     We don't have any documented recommendations for the max deployments or clusters. Anecdotally, I've heard of 100+ clusters. Director does the most work when bootstrapping/updating/terminating clusters and less when just monitoring clusters.      You are also running on a way larger instance than we usually use so I bet there's room for some tuning if you do run into issues (e.g., increasing memory, threadpools, etc). 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		09-20-2018
	
		
		12:07 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 I think that this log line highlights the error.     PluginExceptionCondition{type=ERROR, exceptionInfo={message=Encountered AWS exception, awsErrorCode=InvalidParameterValue, awsErrorMessage=Duplicate tag key 'Name' specified.}}     Are you trying to add a "Name" tag to your instances? Cloudera Altus Director uses the "Name" tag, but can be re-configured to use a different tag instead. See https://www.cloudera.com/documentation/director/latest/topics/director_aws_custom_tag_names.html for instructions on how to do so.    
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		08-02-2018
	
		
		12:20 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
	
		1 Kudo
		
	
				
		
	
		
					
							 iasindev,     The symptoms you describe are due to a bug in the 2.8.0 standalone client. The standalone client commands are deprecated in 2.8.0 and unfortunately suffer from this bug.     In preference order, workarounds are:  1. Use the bootstrap-remote command with a Cloudera Director server. This is the preferred configuration.  or  2. Unset the segment analytics writeKey property for the cli by adding segment.analytics.writeKey: to /etc/cloudera-director-client/application.properties  or  3. Use the 2.7.1 standalone client. It does not suffer from this bug.     Good luck!  David 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		06-19-2018
	
		
		02:19 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
	
		1 Kudo
		
	
				
		
	
		
					
							 sg321,     I've followed the instructions you linked through cluster bootstrap and successfully created a cluster using ami-2051294a in us-east-1.     I did have to make 1 modification to get the spark2 csd to match the spark2 parcel repo. You can do this by either bumping the csd up to http://archive.cloudera.com/spark2/csd/SPARK2_ON_YARN-2.2.0.cloudera2.jar or pinning the parcel repo to http://archive.cloudera.com/spark2/parcels/2.2.0.cloudera2/     Is this reliably/repeatably failing first run for you? 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		06-19-2018
	
		
		08:32 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
	
		1 Kudo
		
	
				
		
	
		
					
							 Tomas79,     Please refer to the CM and CDH documentation. Use the "API Name"  https://www.cloudera.com/documentation/enterprise/properties/5-15-x/topics/cm_props_cdh5150.html     Note that "Service-wide" configurations are specified at the cluster level in the serviceConfigs block while the role-specific configurations are specified per instance group in the roleTypeConfigs blocks.     David 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		06-18-2018
	
		
		08:57 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 sg321,     Please look at this article.  http://community.cloudera.com/t5/CDH-Manual-Installation/How-to-resolve-quot-Permission-denied-quot-errors-in-CDH/ta-p/36141     I find it strange that you are getting this error on a clean cluster bootstrap. Can you share your conf file? Please redact any secrets (e.g., aws keys). What version of Director are you using?     David 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		06-12-2018
	
		
		08:07 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
	
		1 Kudo
		
	
				
		
	
		
					
							 sg321,     You do need to look in CM to find the root cause of the First Run failure.  The CM UI has a tab "All Recent Commands".  https://www.cloudera.com/documentation/enterprise/5-12-x/topics/cm_dg_view_running_recent_commands.html     First Run is composed of a series of steps. You can determine which step failed through the UI.     You can also find this information in the scm-server-log. Information about the step that failed should appear somewhere above the scm-server-log message that you posted.     David 
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		12-21-2017
	
		
		08:19 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 dturner,     The clusters will be independent.     There is one minor caveat. Director finds the instances in AWS based on their "Cloudera-Director-Id" tag. Cloudera Director uses UUIDs for these so they will be unique across all instances. However, Cloudera Director accepts any String for this ID if you are creating your cluster through the API (i.e., not through Cloudera Director's UI or CLI). In that case, it is incumbent upon you to ensure you don't have any ID conflicts.     Here's an example of how you could get into trouble.  If you GET a ClusterTemplate through the API, change the "name", and POST it back to create a new Cluster. Director may find the wrong instances in AWS. To avoid this, you must also change all the "id" fields for all of the "virtualInstances".    
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		12-19-2017
	
		
		10:09 AM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 Andy,     The best place to look is the agent install logs and the agent logs.  /tmp/scm_prepare_node.<Unique ID>  /var/log/cloudera-scm-agent     You should also check that your security group allows full access from other cluster instances (e.g., from other instances in the same security group).     It also looks like you are using custom DNS, but I still see the .ec2.internal addresses in the HostEndpoint list. If you've set up your DHCP Option Set to point to your own DNS server then you should disable DNS Hostnames and/or DNS Resolution on your VPC.     David 
						
					
					... View more