<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Premature EOF: Error while reading and writing data in HDFS in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Premature-EOF-Error-while-reading-and-writing-data-in-HDFS/m-p/269325#M206768</link>
    <description>&lt;P&gt;Ambari 2.6 and HDP 2.6.3.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The error is displayed while performing the following error:&lt;BR /&gt;1) HDFS get operation.&lt;BR /&gt;2) While aggregating and writing file on HDFS using pyspark.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Error:&lt;/STRONG&gt; &lt;FONT color="#FF0000"&gt;"19/08/29 15:53:02 WARN hdfs.DFSClient: Failed to connect to /DN_IP:1019 for block, add to deadNodes and continue. java.io.EOFException: Premature EOF: no length prefix available "&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We found the following links to resolve the above error.&lt;/P&gt;&lt;P&gt;=&amp;gt; To set&lt;STRONG&gt;&amp;nbsp;dfs.datanode.max.transfer.threads=8196&lt;/STRONG&gt;&lt;BR /&gt;&lt;FONT color="#00CCFF"&gt;1) &lt;A href="https://www.netiq.com/documentation/sentinel-82/admin/data/b1nbq4if.html" target="_blank"&gt;https://www.netiq.com/documentation/sentinel-82/admin/data/b1nbq4if.html&lt;/A&gt; (Performance Tuning Guidelines)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="#00CCFF"&gt;2) &lt;A href="https://github.com/hortonworks/structor/issues/7" target="_blank"&gt;https://github.com/hortonworks/structor/issues/7&lt;/A&gt; (jmaron commented on Jul 28, 2014)&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Could you all please suggest shall i go ahead with this resolution?&lt;/P&gt;&lt;P&gt;Does this configuration affects any other services?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thankyou&lt;/P&gt;</description>
    <pubDate>Thu, 29 Aug 2019 11:20:52 GMT</pubDate>
    <dc:creator>rohit_r_sharma</dc:creator>
    <dc:date>2019-08-29T11:20:52Z</dc:date>
    <item>
      <title>Premature EOF: Error while reading and writing data in HDFS</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Premature-EOF-Error-while-reading-and-writing-data-in-HDFS/m-p/269325#M206768</link>
      <description>&lt;P&gt;Ambari 2.6 and HDP 2.6.3.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The error is displayed while performing the following error:&lt;BR /&gt;1) HDFS get operation.&lt;BR /&gt;2) While aggregating and writing file on HDFS using pyspark.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Error:&lt;/STRONG&gt; &lt;FONT color="#FF0000"&gt;"19/08/29 15:53:02 WARN hdfs.DFSClient: Failed to connect to /DN_IP:1019 for block, add to deadNodes and continue. java.io.EOFException: Premature EOF: no length prefix available "&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We found the following links to resolve the above error.&lt;/P&gt;&lt;P&gt;=&amp;gt; To set&lt;STRONG&gt;&amp;nbsp;dfs.datanode.max.transfer.threads=8196&lt;/STRONG&gt;&lt;BR /&gt;&lt;FONT color="#00CCFF"&gt;1) &lt;A href="https://www.netiq.com/documentation/sentinel-82/admin/data/b1nbq4if.html" target="_blank"&gt;https://www.netiq.com/documentation/sentinel-82/admin/data/b1nbq4if.html&lt;/A&gt; (Performance Tuning Guidelines)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT color="#00CCFF"&gt;2) &lt;A href="https://github.com/hortonworks/structor/issues/7" target="_blank"&gt;https://github.com/hortonworks/structor/issues/7&lt;/A&gt; (jmaron commented on Jul 28, 2014)&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Could you all please suggest shall i go ahead with this resolution?&lt;/P&gt;&lt;P&gt;Does this configuration affects any other services?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thankyou&lt;/P&gt;</description>
      <pubDate>Thu, 29 Aug 2019 11:20:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Premature-EOF-Error-while-reading-and-writing-data-in-HDFS/m-p/269325#M206768</guid>
      <dc:creator>rohit_r_sharma</dc:creator>
      <dc:date>2019-08-29T11:20:52Z</dc:date>
    </item>
    <item>
      <title>Re: Premature EOF: Error while reading and writing data in HDFS</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Premature-EOF-Error-while-reading-and-writing-data-in-HDFS/m-p/269824#M207112</link>
      <description>&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Resolved using below steps:&lt;/P&gt;&lt;P&gt;1) To observe the Datanode threads:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Created a widget in Ambari under HDFS for DataNode Threads (Runnable, Waited, Blocked)&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="DataNode_threads.png" style="width: 999px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/24307iB37CCEE9646BD040/image-size/large?v=v2&amp;amp;px=999" role="button" title="DataNode_threads.png" alt="DataNode_threads.png" /&gt;&lt;/span&gt;&lt;/LI&gt;&lt;LI&gt;Monitored that from a particular date the threads went in wait stage.&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="DataNode_threads_Graph.png" style="width: 819px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/24308i963EAF1AD81F9FCB/image-size/large?v=v2&amp;amp;px=999" role="button" title="DataNode_threads_Graph.png" alt="DataNode_threads_Graph.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Exported the graph widget CSV file to view the exact time of wait threads.&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;2)&lt;STRONG&gt; Restart all Datanodes manually and observed that the wait threads were released.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;3) With default 4096 threads the DataNode is running properly.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="DataNode_threads_Graph_resolved.png" style="width: 819px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/24309i31CCFE43C0A514CF/image-size/large?v=v2&amp;amp;px=999" role="button" title="DataNode_threads_Graph_resolved.png" alt="DataNode_threads_Graph_resolved.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#FF0000"&gt;Still unable to understand:&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;1) How to check the wait threads are in which DataNode?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;2) Which task or process tend to threads in the wait stage?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Would like to know if anyone comes across this and able to find in detail.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Else the above steps are the only solution for wait threads.&lt;/P&gt;</description>
      <pubDate>Thu, 05 Sep 2019 11:46:01 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Premature-EOF-Error-while-reading-and-writing-data-in-HDFS/m-p/269824#M207112</guid>
      <dc:creator>rohit_r_sharma</dc:creator>
      <dc:date>2019-09-05T11:46:01Z</dc:date>
    </item>
  </channel>
</rss>

