About Krisz

Krisz · ‎03-27-2017

Hi, Unfortunately in this case you can update the DB the following way: docker exec -it cbreak_cbdb_1 psql -U postgres select id,attributes from credential where cloudplatform='AZURE_RM'; update credential set attributes='XX' where id=X; 1st step to exec into the DB docker container 2nd to select the credential you want to update (please note the ID, and copy the whole attributes filed which you want to update) 3rd update the field with the desired changes (please make sure the formatting is correct and the same as before)

Krisz · ‎03-23-2017

I think this is an Ambari bug. You can report the issue here after you've registered: http://issues.apache.org/jira/browse/AMBARI . Also could you please accept the answer here so others can find the solution while it's beeing fixed. Thank you. Br, Krisz

Krisz · ‎03-23-2017

Hi, 22 and 9443 must be open otherwise the provision will fail. 9443 is used with 2-way-ssl for CB to orchestrate the install.

Krisz · ‎03-23-2017

Hi, I can reproduce your issue on HDP 2.5 as well. It seems like the spark assembly contains invalid azure storage jars. To fix this quickly I did the following: mkdir -p /tmp/jarupdate && cd /tmp/jarupdate find /usr/hdp/ -name "azure-storage*.jar" cp /usr/hdp/2.5.0.1-210/hadoop/lib/azure-storage-2.2.0.jar . cp /usr/hdp/current/spark-historyserver/lib/spark-assembly-1.6.3.2.5.0.1-210-hadoop2.7.3.2.5.0.1-210.jar . unzip azure-storage-2.2.0.jar jar uf spark-assembly-1.6.3.2.5.0.1-210-hadoop2.7.3.2.5.0.1-210.jar com/ mv -f spark-assembly-1.6.3.2.5.0.1-210-hadoop2.7.3.2.5.0.1-210.jar /usr/hdp/current/spark-historyserver/lib/spark-assembly-1.6.3.2.5.0.1-210-hadoop2.7.3.2.5.0.1-210.jar cd .. && rm -rf /tmp/jarupdate Basically I put the desired class files into the assembly jar and updated the original jar file. Once it's done just start the history server and it should be ok. I changed in the spark defaults 2 configuration: spark.eventLog.dir = wasb://cloudbreak492@kriszwasbnorth.blob.core.windows.net/spark-history spark.history.fs.logDirectory = wasb://cloudbreak492@kriszwasbnorth.blob.core.windows.net/spark-history Once the history server started I was able to start the spark teragen job: spark-submit --class com.nexr.spark.terasort.TeraGen --deploy-mode cluster --master yarn-cluster --num-executors 1 spark-terasort-0.1.jar 1G wasb://cloudbreak492@kriszwasbnorth.blob.core.windows.net/teradata

Krisz · ‎03-22-2017

Hi, You can download the CLI like this: curl -kL https://ec2-34-251-140-175.eu-west-1.compute.amazonaws.com/hdc-cli_$(uname)_x86_64.tgz | tar -xz Here is an article about this: HDCloud using the HDC cli

Krisz · ‎03-21-2017

Hi, Are you using wasb as the default filesystem? If so then you can check the hdfs-site for the fs.defaultFS. I assume it should look like something like this: wasb://<storage_account>/spark-history

Krisz · ‎03-09-2017

Hi, It looks like it's a bug in Cloudbreak when you don't attach volumes to the instances. It can be configured when you create a template on the UI. We'll get this fixed.

Krisz · ‎03-02-2017

Download the HDC cli from the cloud controller Although it's possible to download the cli from the cloud controller UI we're going to use CURL for this, since we're about to automate the process. All we need is the address of the cloud controller (we make sure with $(uname) that we download the appropriate binary for our OS, note: all of the instance references are examples, there is no real running aws instance): curl -kL https://ec2-34-251-140-175.eu-west-1.compute.amazonaws.com/hdc-cli_$(uname)_x86_64.tgz | tar -xz Once it's downloaded we can verify it's version: ./hdc --version hdc version 1.13.0-2017-02-09T08:56:06 Configure the cli to use our cloud controller The cli can connect to multiple cloud controllers by providing it's address and credentials. Each command has parameters to provide these informations (even can be configured with environment variables): --server value server address [$CB_SERVER_ADDRESS] --username value user name (e-mail address) [$CB_USER_NAME] --password value password [$CB_PASSWORD] For convenience we could save them into a file so we wouldn't have to provide these parameters for each command: ./hdc configure --server https://ec2-34-251-140-175.eu-west-1.compute.amazonaws.com --username admin@hortonworks.com --password 'adminPassword123!' but it's not really a good idea from automation perspective, because in the logs we want to see each command as it is. Create a cluster using a JSON skeleton Let's generate our base json skeleton: ./hdc create-cluster generate-cli-skeleton { "ClusterName": "", "HDPVersion": "2.5", "ClusterType": "EDW-ETL: Apache Hive 1.2.1, Apache Spark 1.6", "Master": { "InstanceType": "m4.4xlarge", "VolumeType": "gp2", "VolumeSize": 32, "VolumeCount": 1, "InstanceCount": 1, "Recipes": [] }, "Worker": { "InstanceType": "m3.xlarge", "VolumeType": "ephemeral", "VolumeSize": 40, "VolumeCount": 2, "InstanceCount": 3, "Recipes": [], "RecoveryMode": "AUTO" }, "Compute": { "InstanceType": "m3.xlarge", "VolumeType": "ephemeral", "VolumeSize": 40, "VolumeCount": 1, "InstanceCount": 0, "Recipes": [], "RecoveryMode": "AUTO", "SpotPrice": "0" }, "SSHKeyName": "", "RemoteAccess": "", "WebAccess": true, "HiveJDBCAccess": true, "ClusterComponentAccess": false, "ClusterAndAmbariUser": "", "ClusterAndAmbariPassword": "", "InstanceRole": "CREATE", "Network": { "VpcId": "", "SubnetId": "" }, "Tags": {}, "HiveMetastore": { "Name": "", "Username": "", "Password": "", "URL": "", "DatabaseType": "" }, "Configurations": [] } As you can see there are default values for certain properties, but it is also missing a few which we need to provide. To manipulate the json we're going to use a really handy tool called JQ. In this tutorial we're not going to change instance types, volumes etc.. as we could, but for now for demonstration purposes let's set the missing properties and write it to a file called cluster.json: ./hdc create-cluster generate-cli-skeleton | jq '.ClusterName = "tutorial-cluster" | .Worker.InstanceCount = 1 | .Compute.SpotPrice = "0.5" | .Compute.InstanceCount = 1 | .Compute.RecoveryMode = "MANUAL" | .SSHKeyName = "my-aws-key" | .RemoteAccess = "0.0.0.0/0" | .ClusterComponentAccess = true | .ClusterAndAmbariUser = "admin" | .ClusterAndAmbariPassword = "admin"' > cluster.json and the result should look something like this: cat cluster.json { "ClusterName": "tutorial-cluster", "HDPVersion": "2.5", "ClusterType": "Data Science: Apache Spark 1.6, Apache Zeppelin 0.6.0", "Master": { "InstanceType": "m4.4xlarge", "VolumeType": "gp2", "VolumeSize": 32, "VolumeCount": 1, "InstanceCount": 1, "Recipes": [] }, "Worker": { "InstanceType": "m3.xlarge", "VolumeType": "ephemeral", "VolumeSize": 40, "VolumeCount": 2, "InstanceCount": 1, "Recipes": [], "RecoveryMode": "AUTO" }, "Compute": { "InstanceType": "m3.xlarge", "VolumeType": "ephemeral", "VolumeSize": 40, "VolumeCount": 1, "InstanceCount": 1, "Recipes": [], "RecoveryMode": "MANUAL", "SpotPrice": "0.5" }, "SSHKeyName": "my-aws-key", "RemoteAccess": "0.0.0.0/0", "WebAccess": true, "HiveJDBCAccess": true, "ClusterComponentAccess": true, "ClusterAndAmbariUser": "admin", "ClusterAndAmbariPassword": "admin", "InstanceRole": "CREATE", "Network": { "VpcId": "", "SubnetId": "" }, "Tags": {}, "HiveMetastore": { "Name": "", "Username": "", "Password": "", "URL": "", "DatabaseType": "" }, "Configurations": [] } We used 0.0.0.0/0 for the RemoteAccess, but in production clusters it is highly discouraged. Let's create this cluster and wait until it finishes. We're going to use the --wait flag so we don't have to write some custom functions to poll the cluster state: ./hdc create-cluster --cli-input-json cluster.json --server https://ec2-34-251-140-175.eu-west-1.compute.amazonaws.com --username admin@hortonworks.com --password 'adminPassword123!' --wait true Once the command returned we can check the instances: ./hdc describe-cluster instances --cluster-name tutorial-cluster --server https://ec2-34-251-140-175.eu-west-1.compute.amazonaws.com --username admin@hortonworks.com --password 'adminPassword123!' [ { "InstanceId": "i-xxxx", "Hostname": "ip-10-0-1-159.eu-west-1.compute.internal", "PublicIP": "x.x.x.x", "PrivateIP": "10.0.1.159", "InstanceStatus": "REGISTERED", "HostStatus": "HEALTHY", "Type": "master - ambari server" }, { "InstanceId": "i-xxxx", "Hostname": "ip-10-0-1-185.eu-west-1.compute.internal", "PublicIP": "x.x.x.x", "PrivateIP": "10.0.1.185", "InstanceStatus": "REGISTERED", "HostStatus": "HEALTHY", "Type": "worker" }, { "InstanceId": "i-xxxx", "Hostname": "ip-10-0-1-197.eu-west-1.compute.internal", "PublicIP": "x.x.x.x", "PrivateIP": "10.0.1.197", "InstanceStatus": "REGISTERED", "HostStatus": "HEALTHY", "Type": "compute" } ] Terminate the cluster Now that we have a cluster we can execute jobs, queries etc.. which in this tutorial we're not going to cover and if we no longer need the cluster we can simply terminate it: ./hdc terminate-cluster --cluster-name tutorial-cluster --server https://ec2-34-251-140-175.eu-west-1.compute.amazonaws.com --username admin@hortonworks.com --password 'adminPassword123!' --wait true

Krisz · ‎02-15-2017

I think the root has been changed can you try with: cb/api/v1/stacks/3 instead of /api/v1/stacks/3.

Krisz · ‎12-08-2016

You can see the supported versions here: http://sequenceiq.com/cloudbreak-docs/latest/openstack/ Unfortunately we didn't get a chance to test it with newton, but most likely it works with that version too.

Online	Offline
Last Visited	‎01-17-2024 04:50 AM

Member Since	‎03-19-2019 02:21 AM
Last Visited	‎01-17-2024 04:50 AM
Posts	67
Kudos received	38

Cloudera Community

Re: Postgres as a external database with ssl on cl...

Re: Why Cloudbreak creates sometimes hosts with wr...

Re: Can we set up cloudbreak deployer as HA

Re: Cloudbreak: How to enable High Availability fo...

Re: Cloudbreak is executing Post type recipe befor...

Re: Update existing Azure credential with new subs...

Re: spark hdfs uri missing host on hdp 2.4

Re: Every cluster creation hangs at update in prog...

Re: spark hdfs uri missing host on hdp 2.4

Re: Is there an api call to download HDC CLI tar f...

Re: spark hdfs uri missing host on hdp 2.4

Re: Cloudbreak on Openstack not populating local s...

Automated cluster creation on HDCloud using the HD...

Re: Not able to use rest endpoints of cloudbreak t...

Re: cloudbreak and openstack newtron