<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: How to import Pandas and Numpy in the Livy2. in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/How-to-import-Pandas-and-Numpy-in-the-Livy2/m-p/288773#M213850</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/51326"&gt;@Sambavi&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;You can install any required dependencies on all nodes and use them but you need to keep in mind that Pandas and Numpy doesn't provide distributed computing option and it wouldn't work with big data sets.&lt;/P&gt;&lt;P&gt;If your zeppelin configured to use yarn cluster mode&amp;nbsp;It will take all data to spark driver in data node where spark driver located and try to process it there. (if its not big data set you can increase driver resources and it will work but its not looks like solution)&lt;/P&gt;&lt;P&gt;if you use client mode it will take everything in zeppelin node.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I recommend to try HandySpark&amp;nbsp;&lt;A href="https://github.com/dvgodoy/handyspark" target="_blank" rel="noopener"&gt;https://github.com/dvgodoy/handyspark&lt;/A&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 31 Jan 2020 07:16:33 GMT</pubDate>
    <dc:creator>goga</dc:creator>
    <dc:date>2020-01-31T07:16:33Z</dc:date>
    <item>
      <title>How to import Pandas and Numpy in the Livy2.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-import-Pandas-and-Numpy-in-the-Livy2/m-p/288735#M213841</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Can some one help me how to import pandas and Numpy in Livy2.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I'm using&amp;nbsp;zeppelin and using the Livy2. In the Livy i'm creating a notebook and provide the following.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;%pyspark&lt;/P&gt;
&lt;P&gt;import pandas as py&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I see the below error as&lt;/P&gt;
&lt;P&gt;"No module found"&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If i use the same using the pyspark interpreter it looks good and i don't have any issues. Need some help on this.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks&lt;/P&gt;
&lt;P&gt;Sambasivam.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 31 Jan 2020 02:57:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-import-Pandas-and-Numpy-in-the-Livy2/m-p/288735#M213841</guid>
      <dc:creator>Sambavi</dc:creator>
      <dc:date>2020-01-31T02:57:27Z</dc:date>
    </item>
    <item>
      <title>Re: How to import Pandas and Numpy in the Livy2.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-import-Pandas-and-Numpy-in-the-Livy2/m-p/288773#M213850</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/51326"&gt;@Sambavi&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;You can install any required dependencies on all nodes and use them but you need to keep in mind that Pandas and Numpy doesn't provide distributed computing option and it wouldn't work with big data sets.&lt;/P&gt;&lt;P&gt;If your zeppelin configured to use yarn cluster mode&amp;nbsp;It will take all data to spark driver in data node where spark driver located and try to process it there. (if its not big data set you can increase driver resources and it will work but its not looks like solution)&lt;/P&gt;&lt;P&gt;if you use client mode it will take everything in zeppelin node.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I recommend to try HandySpark&amp;nbsp;&lt;A href="https://github.com/dvgodoy/handyspark" target="_blank" rel="noopener"&gt;https://github.com/dvgodoy/handyspark&lt;/A&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 31 Jan 2020 07:16:33 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-import-Pandas-and-Numpy-in-the-Livy2/m-p/288773#M213850</guid>
      <dc:creator>goga</dc:creator>
      <dc:date>2020-01-31T07:16:33Z</dc:date>
    </item>
    <item>
      <title>Re: How to import Pandas and Numpy in the Livy2.</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-to-import-Pandas-and-Numpy-in-the-Livy2/m-p/289882#M214514</link>
      <description>&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I did try this and it worked out fine.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Mon, 17 Feb 2020 01:47:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-to-import-Pandas-and-Numpy-in-the-Livy2/m-p/289882#M214514</guid>
      <dc:creator>Sambavi</dc:creator>
      <dc:date>2020-02-17T01:47:45Z</dc:date>
    </item>
  </channel>
</rss>

