Member since
10-28-2014
82
Posts
18
Kudos Received
17
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1700 | 07-11-2019 08:49 AM | |
4679 | 08-02-2018 12:20 PM | |
6891 | 06-19-2018 02:19 PM | |
3101 | 06-19-2018 08:32 AM | |
2912 | 12-21-2017 08:19 AM |
07-11-2019
12:14 PM
You are correct in that Director will attempt to connect via each IP in the list, preferring to connect over the private IP. You should be fine if only one of the IPs is accessible. Director should cycle through the IPs until the configured timeout, default 20 minutes after instance allocation. Is the cluster failing to bootstrap? There are some natural places where we would expect connectivity to fail in a transient manner. E.g., when the VM is first allocated but doesn't yet respond to ssh, when rebooting the VM. If you are seeing failures that are preventing the cluster from bootstrapping (or even causing individual instances to fail) then we would be interested in seeing the log files to investigate.
... View more
07-11-2019
08:49 AM
dturner, We don't have any documented recommendations for the max deployments or clusters. Anecdotally, I've heard of 100+ clusters. Director does the most work when bootstrapping/updating/terminating clusters and less when just monitoring clusters. You are also running on a way larger instance than we usually use so I bet there's room for some tuning if you do run into issues (e.g., increasing memory, threadpools, etc).
... View more
09-20-2018
12:07 PM
I think that this log line highlights the error. PluginExceptionCondition{type=ERROR, exceptionInfo={message=Encountered AWS exception, awsErrorCode=InvalidParameterValue, awsErrorMessage=Duplicate tag key 'Name' specified.}} Are you trying to add a "Name" tag to your instances? Cloudera Altus Director uses the "Name" tag, but can be re-configured to use a different tag instead. See https://www.cloudera.com/documentation/director/latest/topics/director_aws_custom_tag_names.html for instructions on how to do so.
... View more
08-02-2018
12:20 PM
1 Kudo
iasindev, The symptoms you describe are due to a bug in the 2.8.0 standalone client. The standalone client commands are deprecated in 2.8.0 and unfortunately suffer from this bug. In preference order, workarounds are: 1. Use the bootstrap-remote command with a Cloudera Director server. This is the preferred configuration. or 2. Unset the segment analytics writeKey property for the cli by adding segment.analytics.writeKey: to /etc/cloudera-director-client/application.properties or 3. Use the 2.7.1 standalone client. It does not suffer from this bug. Good luck! David
... View more
06-19-2018
02:19 PM
1 Kudo
sg321, I've followed the instructions you linked through cluster bootstrap and successfully created a cluster using ami-2051294a in us-east-1. I did have to make 1 modification to get the spark2 csd to match the spark2 parcel repo. You can do this by either bumping the csd up to http://archive.cloudera.com/spark2/csd/SPARK2_ON_YARN-2.2.0.cloudera2.jar or pinning the parcel repo to http://archive.cloudera.com/spark2/parcels/2.2.0.cloudera2/ Is this reliably/repeatably failing first run for you?
... View more
06-19-2018
08:32 AM
1 Kudo
Tomas79, Please refer to the CM and CDH documentation. Use the "API Name" https://www.cloudera.com/documentation/enterprise/properties/5-15-x/topics/cm_props_cdh5150.html Note that "Service-wide" configurations are specified at the cluster level in the serviceConfigs block while the role-specific configurations are specified per instance group in the roleTypeConfigs blocks. David
... View more
06-18-2018
08:57 AM
sg321, Please look at this article. http://community.cloudera.com/t5/CDH-Manual-Installation/How-to-resolve-quot-Permission-denied-quot-errors-in-CDH/ta-p/36141 I find it strange that you are getting this error on a clean cluster bootstrap. Can you share your conf file? Please redact any secrets (e.g., aws keys). What version of Director are you using? David
... View more
06-12-2018
08:07 AM
1 Kudo
sg321, You do need to look in CM to find the root cause of the First Run failure. The CM UI has a tab "All Recent Commands". https://www.cloudera.com/documentation/enterprise/5-12-x/topics/cm_dg_view_running_recent_commands.html First Run is composed of a series of steps. You can determine which step failed through the UI. You can also find this information in the scm-server-log. Information about the step that failed should appear somewhere above the scm-server-log message that you posted. David
... View more
12-21-2017
08:19 AM
dturner, The clusters will be independent. There is one minor caveat. Director finds the instances in AWS based on their "Cloudera-Director-Id" tag. Cloudera Director uses UUIDs for these so they will be unique across all instances. However, Cloudera Director accepts any String for this ID if you are creating your cluster through the API (i.e., not through Cloudera Director's UI or CLI). In that case, it is incumbent upon you to ensure you don't have any ID conflicts. Here's an example of how you could get into trouble. If you GET a ClusterTemplate through the API, change the "name", and POST it back to create a new Cluster. Director may find the wrong instances in AWS. To avoid this, you must also change all the "id" fields for all of the "virtualInstances".
... View more
12-19-2017
10:09 AM
Andy, The best place to look is the agent install logs and the agent logs. /tmp/scm_prepare_node.<Unique ID> /var/log/cloudera-scm-agent You should also check that your security group allows full access from other cluster instances (e.g., from other instances in the same security group). It also looks like you are using custom DNS, but I still see the .ec2.internal addresses in the HostEndpoint list. If you've set up your DHCP Option Set to point to your own DNS server then you should disable DNS Hostnames and/or DNS Resolution on your VPC. David
... View more