The CCP exam, including the CCP:Data Science exam do not force you to follow a particular tool or programming language. The explain a scenario and ask you to come up with a set of results based on the problem description.
The DS200 Solution Kit is the solution that our data scientists came up with during one of the early exams. You can choose any language to work in. They chose Python. This was before Spark was even supported on our clusters. It is merely there to be reprentative of the types of questions that are asked during the exam, not to show you how you must answer the new exam.
You may install any tools and any libraries that you would like to use during the DS exam. There are dozens of things already installed, that are detailed on the certification webpages.
Currently, the cluster is open to the internet and there are no restrictions on tools you can install or websites or resources you may use.
Perhaps this will answer your question.