Member since
07-09-2015
70
Posts
29
Kudos Received
12
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
12020 | 11-23-2018 03:38 AM | |
2853 | 10-07-2018 11:44 PM | |
3554 | 09-24-2018 12:09 AM | |
5683 | 09-13-2018 02:27 AM | |
3848 | 09-12-2018 02:27 AM |
10-09-2017
11:21 AM
Hello, I can reproduce the "Engine exited with status 2." if I'm using the v1 engine but it works with the v2 engine. What version are you using? - If you are using 1.0.x then I recommend upgrading to 1.1.1. - If you are already using 1.1.1 then you should use the v2 engine. You can go to your Settings menu on the Project page and select the "Base Image v2, docker.repository.cloudera.com/cdsw/engine:2" on the Engines tab. You can also change the default engine in the Admin menu but it will be applied to new projects, for already existing ones you need to select it manually. Regards, Peter
... View more
07-24-2017
09:01 AM
1 Kudo
Hi, Did you add the dedicated CDSW host to the cluster in CM? From the documentation: "Cloudera Data Science Workbench hosts must be added to your CDH cluster as gateway hosts, with gateway roles properly configured." https://www.cloudera.com/documentation/data-science-workbench/latest/topics/cdsw_install.html#configure_edge_hosts Regards, Peter
... View more
07-13-2017
05:19 AM
Hi Chris, You can use the SparkSession object to query your Hive tables. You don't need to create a HiveContext. spark.catalog.listTables.show()
spark.sql("describe database default").show
spark.sql("describe formatted default.mytable").show I don't think that this change would fix your issue. This doesn't seem like a problem with your CDSW installation. Could you execute the same commands from a Session terminal or directly on the CDSW host machine using the spark2-shell cli tool to verify this? Could you double check that a Hive gateway is deployed to your CDSW host? Can you show that the table exists from eg. beeline? Regards, Peter
... View more
07-13-2017
04:06 AM
Hello Chris, What are you trying to achieve? Why are you trying to run in local mode? Can you use the expected yarn client mode instead? From the documentation: "Unlike PySpark or Sparklyr, you can access a SparkContext assigned to the spark (SparkSession) and sc(SparkContext) objects on console startup, just as when using the Spark shell." https://www.cloudera.com/documentation/data-science-workbench/latest/topics/cdsw_spark_and_scala.html Can you access to your hive tables using the spark SparkSession object? Regards, Peter
... View more
07-06-2017
07:59 AM
Hello, This is weird. Could you try a manual docker pull? docker.repository.cloudera.com/cdsw/1.0.1/web:052787a Thanks, Peter
... View more
07-05-2017
01:32 AM
Hi NES, I'm glad that your issue is resolved! Could you let us know what was the issue for the other topic followers? Thanks, Peter
... View more
07-05-2017
12:45 AM
Hi NES, I see that the web pods are stuck in ContainerCreating state. I would try to submit a kubectl describe command on one of the web pods and check if we have any associated events. $ kubectl describe pod web-3826671331-5b7wk If there are no events I would check the kubelet logs, maybe it shows the reason why are we stuck: # get the kubelet logs in less $ journalctl -u kubelet # tail the kubelet logs $ journalctl -u kubelet -o cat -l -f Regards, Peter
... View more
07-04-2017
05:48 AM
Hi NES, Could you share the full 'cdsw status' output? I would like to see what system pods are not ready. There is a db-migrate-XXXXX-xxxxx Kubernetes job which should make sure that the tables are created. When you submit the 'cdsw status' command, can you see this job in the output? Did you see a Completed status for this? When you changed the cdsw configuration, did you submit a 'cdsw reset' before retrying the 'cdsw init'? If not, you will need to do a 'cdsw reset' and 'cdsw init' again. 'cdsw logs' generates a tar.gz file with different diagnostic information. If you want to validate your cluster, I recommend to use the 'cdsw validate' instead. Thanks, Peter
... View more
06-22-2017
02:25 AM
Hi Krishna, There is the true flag under the Stateful column, just it's not rendered nicely, please disregard the previous post. Could you try to restart your cdsw application and upload the `cdsw status` command output again? We shouldn't see multiple database pods there. Thanks, Peter
... View more
06-22-2017
02:20 AM
Hi Krishna, One issue we see is the master node is missing the stateful true flag. Something might went wrong with your installation. The code which would put the stateful tag on the node is using the `hostname` command output. Could you check if the `hostname` output matches the node name you see in the `cdsw status` output? I would start by doing `cdsw stop` and `cdsw start` on the master node and see if the stateful flag gets there. If this is not working you will probably need to do a `cdsw reset` and `cdsw init` for the master and workers also. Regards, Peter
... View more
- « Previous
- Next »