Community Articles

Find and share helpful community-sourced technical articles.
Announcements
Celebrating as our community reaches 100,000 members! Thank you!
avatar
Contributor

The following steps can be used to configure Intellij to run Spark jobs on Cloudera Data Engineering experience (CDE). This way the developers will be able to test their jobs on CDE without having to leave the IDE. The following steps are shown for an Intellij installed on a Windows PC.

  1. Complete setting up CDE CLI with Git Bash as per this article.
  2. In IntelliJ, go to File > Settings, modify the "shell path" to the following:
    C:\Program Files\Git\bin\bash.exe" --login -i
    3.png
  3. Test this setting by launching a terminal from within the IntelliJ project and running CDE CLI commands. If you are unable to run CDE CLI commands (with environment variable CDE_CONFIG not configured), you will have the option to setup/override it in the next step.
    3.png
  4. Add a Run/Debug configuration with a new Shell script with the following details. Ignore the error stating that the Shell script is not found. You have the option of overriding the CDE_CONFIG environment variable (in case you want to submit it to a different cluster than the system default). Use environment variables to supply arguments to your Spark job:
    4.png
  5. Run the cde spark submit from the Run/Debug menu as shown as follows. You should see the job run in the terminal window in Intellij.
    5.png
1,494 Views