- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
PySpark in Zeppelin: Does not have all libraries
- Labels:
-
Apache Zeppelin
Created ‎12-09-2015 02:46 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Iam able to import a library in pyspark shell without any problems, but when I try to import the same library in Zeppelin, I get an error
ImportError: No module named xxxxx
Created ‎12-09-2015 03:05 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Created ‎12-09-2015 02:52 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Do you have multiple Python versions installed?
Created ‎12-09-2015 03:05 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Please check this http://stackoverflow.com/questions/13212987/cannot...
Created ‎12-09-2015 03:20 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I had 2 versions of Python installed. Zeppelin is still using the older one.
Created ‎12-09-2015 04:58 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
For anyone who else may encounter this issue and end up here: this is most commonly the result of having multiple python versions installed. However if you are using Zeppelin (which is the case here), it is pretty easy to point to different version of python. In Zeppelin UI > Interpreter > Spark > Change the 'zeppelin.pyspark.python' property from 'python' to '/path/to/correct/pythondir/python' and click Save
Created ‎12-09-2015 07:55 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Should it be the python directory or the pyspark directory?
as in /usr/loca/../python or /usr/hdp/2..../spark/python
Created ‎12-10-2015 05:13 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
@Vedant Jain: I believe it should be the full path to the python executable you wish to use (assuming you don't want to use the default)
