<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: broken pipe error while running a spark job from namenode in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/broken-pipe-error-while-running-a-spark-job-from-namenode/m-p/197246#M62346</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/10230/hardikvdesai.html" nodeid="10230"&gt;@hardik desai
&lt;/A&gt;&lt;/P&gt;&lt;P&gt;As the issue is intermittent so this may not be script related issue.&lt;/P&gt;&lt;P&gt;However based on the error pattern it mostly look the driver is trying to create a new connection and the DBMS breaking 
the socket between it and the driver, at a very early stage of the 
process. 
This has to be either a network issue, or more likely a DBMS issue like 
if there are too many connection requests at once or in a short burst, 
the DBMS listener process gets overloaded and severs some of the 
incoming
sockets.&lt;/P&gt;&lt;P&gt;- So we will need to check why the connection was not established.  &lt;/P&gt;&lt;P&gt;  - Due to Load on the DB (Check the DB logs of the same timestamp)&lt;/P&gt;&lt;P&gt;  - Due to Load on your machine. ( Check the SAR report to find the historical data of the OS  of the mentioned timestamp)&lt;/P&gt;&lt;P&gt;  - N/W drops also might be a reason.  Check the "/var/log/messages"  to see if any thing unusual happened during the time of error.&lt;/P&gt;&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/10230/hardikvdesai.html" nodeid="10230"&gt;&lt;/A&gt; &lt;/P&gt;</description>
    <pubDate>Thu, 08 Jun 2017 12:30:36 GMT</pubDate>
    <dc:creator>jsensharma</dc:creator>
    <dc:date>2017-06-08T12:30:36Z</dc:date>
    <item>
      <title>broken pipe error while running a spark job from namenode</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/broken-pipe-error-while-running-a-spark-job-from-namenode/m-p/197243#M62343</link>
      <description>&lt;P&gt;While running a spark job i have found the error as mentioned below.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;py4j.protocol.Py4JJavaError: An error occurred while calling o42.load.
: java.sql.SQLRecoverableException: Io exception: Broken pipe&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;This is the file &lt;A href="https://community.cloudera.com/legacyfs/online/attachments/15995-sparkrun.txt"&gt;sparkrun.txt&lt;/A&gt; which i am running as shell from name-node.&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.cloudera.com/legacyfs/online/attachments/15994-spark-err1.txt"&gt;spark-err1.txt&lt;/A&gt;- this is the error log which i am getting while running the spark job.&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.cloudera.com/legacyfs/online/attachments/15995-sparkrun.txt"&gt;&lt;/A&gt;&lt;A href="https://community.cloudera.com/legacyfs/online/attachments/15996-stest-py.txt"&gt;stest-py.txt&lt;/A&gt; - this is the python file which i have mentioned in my shell file called sparkrun.txt&lt;/P&gt;&lt;P&gt;plz help in this , as i am not able to find any clue in this.&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.hortonworks.com/users/3418/jsensharma.html"&gt;Jay SenSharma&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 05 Jun 2017 13:53:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/broken-pipe-error-while-running-a-spark-job-from-namenode/m-p/197243#M62343</guid>
      <dc:creator>hardikv_desai</dc:creator>
      <dc:date>2017-06-05T13:53:08Z</dc:date>
    </item>
    <item>
      <title>Re: broken pipe error while running a spark job from namenode</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/broken-pipe-error-while-running-a-spark-job-from-namenode/m-p/197244#M62344</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/10230/hardikvdesai.html" nodeid="10230"&gt;@hardik desai&lt;/A&gt;&lt;/P&gt;&lt;P&gt;The error seems to be because of logon to oracle DB.&lt;/P&gt;&lt;PRE&gt;df = sqlContext.read.format("jdbc").option("driver", "oracle.jdbc.OracleDriver").option("url","jdbc:oracle:thin:NE/Network_147@10.77.1.147:1521/ELLDEV").option("dbtable","NE.INTER_APP_EVENT").load()

py4j.protocol.Py4JJavaError: An error occurred while calling o42.load.
: java.sql.SQLRecoverableException: Io exception: Broken pipe
        at oracle.jdbc.driver.SQLStateMapping.newSQLException(SQLStateMapping.java:101)
        at oracle.jdbc.driver.DatabaseError.newSQLException(DatabaseError.java:133)
        at oracle.jdbc.driver.DatabaseError.throwSqlException(DatabaseError.java:199)
        at oracle.jdbc.driver.DatabaseError.throwSqlException(DatabaseError.java:263)
        at oracle.jdbc.driver.DatabaseError.throwSqlException(DatabaseError.java:521)
        at oracle.jdbc.driver.T4CConnection.logon(T4CConnection.java:418)
        at oracle.jdbc.driver.PhysicalConnection.&amp;lt;init&amp;gt;(PhysicalConnection.java:508)
        at oracle.jdbc.driver.T4CConnection.&amp;lt;init&amp;gt;(T4CConnection.java:203)&lt;/PRE&gt;&lt;P&gt;.&lt;/P&gt;&lt;P&gt;You should check few things :&lt;/P&gt;&lt;P&gt;1.  Using some sqlplus or oracle client tool are you able to connect to the mentioned URL?&lt;/P&gt;&lt;PRE&gt;jdbc:oracle:thin:NE/Network_147@10.77.1.147:1521/ELLDEV&lt;/PRE&gt;&lt;P&gt;2. Try doing telnet from the machine where you are executing the script to see port access.&lt;/P&gt;&lt;PRE&gt;telnet   10.77.1.147   1521&lt;/PRE&gt;&lt;P&gt;3. Broken Pipe error usually indicates a broken communication. Mostly due to abrupt termination of connection from the other end or N/W issue.&lt;/P&gt;&lt;P&gt;4. Check if the Oracle credentials that you are using are correct not expired/blocked.&lt;/P&gt;&lt;P&gt;5. Can you try passing the credentials as well using "user" and "password" options? Instead of passing it via the URL of the DB&lt;/P&gt;&lt;PRE&gt;options.put("user", "USER").options.put("password", "PASS")&lt;/PRE&gt;&lt;P&gt;.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;In General:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;The driver is trying to create a new connection and the DBMS breaking the socket between it and the driver, at a very early stage of the process. 
This has to be either a network issue, or more likely a DBMS issue like if there are too many connection requests at once or in a short burst, the DBMS listener process gets overloaded and severs some of the incoming
sockets.&lt;/P&gt;</description>
      <pubDate>Thu, 08 Jun 2017 12:15:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/broken-pipe-error-while-running-a-spark-job-from-namenode/m-p/197244#M62344</guid>
      <dc:creator>jsensharma</dc:creator>
      <dc:date>2017-06-08T12:15:22Z</dc:date>
    </item>
    <item>
      <title>Re: broken pipe error while running a spark job from namenode</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/broken-pipe-error-while-running-a-spark-job-from-namenode/m-p/197245#M62345</link>
      <description>&lt;P&gt;&lt;A href="https://community.hortonworks.com/users/3418/jsensharma.html"&gt;Jay SenSharma&lt;/A&gt;, thanks for the reply...&lt;/P&gt;&lt;P&gt;I am able to telnet as you mentioned. Trying to fetch data from oracle DB to HDFS using the script mentioned here. Also note that this is running every hour and i am able to run it successfully in few attempts( 4-5 times a day ) but all the other times its got failed with the erorr given here.&lt;/P&gt;&lt;P&gt;Can you plz provide any more troubleshooting help if possible?&lt;/P&gt;&lt;P&gt;thanks ....&lt;/P&gt;</description>
      <pubDate>Thu, 08 Jun 2017 12:24:29 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/broken-pipe-error-while-running-a-spark-job-from-namenode/m-p/197245#M62345</guid>
      <dc:creator>hardikv_desai</dc:creator>
      <dc:date>2017-06-08T12:24:29Z</dc:date>
    </item>
    <item>
      <title>Re: broken pipe error while running a spark job from namenode</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/broken-pipe-error-while-running-a-spark-job-from-namenode/m-p/197246#M62346</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/10230/hardikvdesai.html" nodeid="10230"&gt;@hardik desai
&lt;/A&gt;&lt;/P&gt;&lt;P&gt;As the issue is intermittent so this may not be script related issue.&lt;/P&gt;&lt;P&gt;However based on the error pattern it mostly look the driver is trying to create a new connection and the DBMS breaking 
the socket between it and the driver, at a very early stage of the 
process. 
This has to be either a network issue, or more likely a DBMS issue like 
if there are too many connection requests at once or in a short burst, 
the DBMS listener process gets overloaded and severs some of the 
incoming
sockets.&lt;/P&gt;&lt;P&gt;- So we will need to check why the connection was not established.  &lt;/P&gt;&lt;P&gt;  - Due to Load on the DB (Check the DB logs of the same timestamp)&lt;/P&gt;&lt;P&gt;  - Due to Load on your machine. ( Check the SAR report to find the historical data of the OS  of the mentioned timestamp)&lt;/P&gt;&lt;P&gt;  - N/W drops also might be a reason.  Check the "/var/log/messages"  to see if any thing unusual happened during the time of error.&lt;/P&gt;&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/10230/hardikvdesai.html" nodeid="10230"&gt;&lt;/A&gt; &lt;/P&gt;</description>
      <pubDate>Thu, 08 Jun 2017 12:30:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/broken-pipe-error-while-running-a-spark-job-from-namenode/m-p/197246#M62346</guid>
      <dc:creator>jsensharma</dc:creator>
      <dc:date>2017-06-08T12:30:36Z</dc:date>
    </item>
    <item>
      <title>Re: broken pipe error while running a spark job from namenode</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/broken-pipe-error-while-running-a-spark-job-from-namenode/m-p/197247#M62347</link>
      <description>&lt;P&gt;&lt;A href="https://community.hortonworks.com/users/3418/jsensharma.html"&gt;Jay SenSharma&lt;/A&gt; thanks, i will check the logs to find out why it is dropping the connections and let you know...&lt;/P&gt;</description>
      <pubDate>Thu, 08 Jun 2017 12:42:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/broken-pipe-error-while-running-a-spark-job-from-namenode/m-p/197247#M62347</guid>
      <dc:creator>hardikv_desai</dc:creator>
      <dc:date>2017-06-08T12:42:44Z</dc:date>
    </item>
  </channel>
</rss>

