I use Cloudera VM (running through VirtualBox on a Mac) for an online analytics course. The exercise involves running Scala on a given dataset. The given spark shell command does not load all the jar files.
Part of the console output displays the following: Warning: Local jar /home/cloudera/lib/jcommon-1.0.16.jar does not exist, skipping.
Further on in the exercise, I run the following command.
The console throws an error. Here's the partial output
17/03/25 09:36:58 ERROR executor.Executor: Exception in task 0.0 in stage 0.0 (TID 0)
java.lang.RuntimeException: Stream '/jars/jcommon-' was not found.
Since this vm is a self-contained environment created by Cloudera, I have no idea how to 1) update this Java instance to include missing jar files, or 2) circumvent Cloudera's failure to include a proper install so that I can complete the exercise.