03-31-2017 04:51 AM
I use Cloudera VM (running through VirtualBox on a Mac) for an online analytics course. The exercise involves running Scala on a given dataset. The given spark shell command does not load all the jar files.
spark-shell --jars lib/gs-core-1.2.jar,lib/gs-ui-1.2.jar,lib/jcommon-1.0.16.jar,lib/jfreechart-1.0.13.jar,lib/breeze_2.10-0.9.jar,lib/breeze-viz_2.10-0.9.jar,lib/pherd-1.0.jar
Part of the console output displays the following: Warning: Local jar /home/cloudera/lib/jcommon-1.0.16.jar does not exist, skipping.
Further on in the exercise, I run the following command.
The console throws an error. Here's the partial output
17/03/25 09:36:58 ERROR executor.Executor: Exception in task 0.0 in stage 0.0 (TID 0) java.lang.RuntimeException: Stream '/jars/jcommon-' was not found.
Since this vm is a self-contained environment created by Cloudera, I have no idea how to 1) update this Java instance to include missing jar files, or 2) circumvent Cloudera's failure to include a proper install so that I can complete the exercise.
I appreciate your help in resolving this issue.