<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Flume + HDFS IO error + ConnectException in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Flume-HDFS-IO-error-ConnectException/m-p/28157#M6165</link>
    <description>&lt;P&gt;Hi,&lt;BR /&gt;I'm working with Cloudera Manager CDH 5.4.2, also installed Flume, I can not save the information that I get from Twitter,&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;When I run the flume agent, it starts okay but ends up in error when it attempts writing the new event data into hdfs.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;I got the follow error:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;INFO org.apache.flume.sink.hdfs.BucketWriter: Creating hdfs://192.168.109.6:8020/user/flume/tweets/2015/06/03/06//FlumeData.1433311217583.tmp&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;WARN org.apache.flume.sink.hdfs.HDFSEventSink: HDFS IO error&lt;BR /&gt;java.net.ConnectException: Call From cluster-05.xxxx.com/192.168.109.6 to cluster-05.xxxx.com:8020 failed on connection exception: java.net.ConnectException: Connection refused; For more details see: &lt;A href="http://wiki.apache.org/hadoop/ConnectionRefused" target="_blank"&gt;http://wiki.apache.org/hadoop/ConnectionRefused&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The configuration that I did was :&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;U&gt;flume-conf.property:&lt;/U&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;TwitterAgent.sinks.HDFS.channel = MemChannel&lt;BR /&gt;TwitterAgent.sinks.HDFS.type = hdfs&lt;BR /&gt;TwitterAgent.sinks.HDFS.hdfs.path = hdfs://192.168.109.6:8020/user/flume/tweets/%Y/%m/%d/%H/&lt;BR /&gt;TwitterAgent.sinks.HDFS.hdfs.fileType = DataStream&lt;BR /&gt;TwitterAgent.sinks.HDFS.hdfs.writeFormat = Text&lt;BR /&gt;TwitterAgent.sinks.HDFS.hdfs.batchSize = 1000&lt;BR /&gt;TwitterAgent.sinks.HDFS.hdfs.rollSize = 0&lt;BR /&gt;TwitterAgent.sinks.HDFS.hdfs.rollCount = 10000&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;I using the follown pluggins:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN&gt;flume-sources-1.0-SNAPSHOT.jar&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN&gt;twitter4j-core-2.2.6.jar&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN&gt;twitter4j-media-support-2.2.6.jar&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN&gt;twitter4j-stream-2.2.6.jar&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;(I replace the version of the twitter4j-*-3.0.3.jar for the twitter4j-*-2.2.6.jar)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;also the directory using hdfs user&lt;/P&gt;&lt;P&gt;hadoop fs -ls /user/flume :&amp;nbsp;&lt;/P&gt;&lt;P&gt;drwxrwxrwx - flume flume &amp;nbsp;/user/flume/tweets&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;U&gt;core-site.xml ( at /hadoop/conf ) i Add:&lt;/U&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;lt; property &amp;gt;&lt;BR /&gt;&amp;lt; name &amp;gt;fs.default.name&amp;lt; / name &amp;gt;&lt;BR /&gt;&amp;lt; value &amp;gt;hdfs://localhost:8020&amp;lt; / value &amp;gt;&lt;BR /&gt;&amp;lt; /property &amp;gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I also run&amp;nbsp;&lt;EM&gt;hadoop dfsadmin -safemode leave&amp;nbsp;&lt;/EM&gt;on the host where I left the Flume Agent as &lt;EM&gt;HDFS&lt;/EM&gt;&amp;nbsp;user&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I really appreciate your help, on this issue.&lt;/P&gt;&lt;P&gt;&amp;nbsp;Regards,&lt;/P&gt;&lt;P&gt;AR&lt;/P&gt;</description>
    <pubDate>Wed, 03 Jun 2015 07:17:31 GMT</pubDate>
    <dc:creator>anton_85</dc:creator>
    <dc:date>2015-06-03T07:17:31Z</dc:date>
    <item>
      <title>Flume + HDFS IO error + ConnectException</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Flume-HDFS-IO-error-ConnectException/m-p/28157#M6165</link>
      <description>&lt;P&gt;Hi,&lt;BR /&gt;I'm working with Cloudera Manager CDH 5.4.2, also installed Flume, I can not save the information that I get from Twitter,&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;When I run the flume agent, it starts okay but ends up in error when it attempts writing the new event data into hdfs.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;I got the follow error:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;INFO org.apache.flume.sink.hdfs.BucketWriter: Creating hdfs://192.168.109.6:8020/user/flume/tweets/2015/06/03/06//FlumeData.1433311217583.tmp&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;WARN org.apache.flume.sink.hdfs.HDFSEventSink: HDFS IO error&lt;BR /&gt;java.net.ConnectException: Call From cluster-05.xxxx.com/192.168.109.6 to cluster-05.xxxx.com:8020 failed on connection exception: java.net.ConnectException: Connection refused; For more details see: &lt;A href="http://wiki.apache.org/hadoop/ConnectionRefused" target="_blank"&gt;http://wiki.apache.org/hadoop/ConnectionRefused&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The configuration that I did was :&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;U&gt;flume-conf.property:&lt;/U&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;TwitterAgent.sinks.HDFS.channel = MemChannel&lt;BR /&gt;TwitterAgent.sinks.HDFS.type = hdfs&lt;BR /&gt;TwitterAgent.sinks.HDFS.hdfs.path = hdfs://192.168.109.6:8020/user/flume/tweets/%Y/%m/%d/%H/&lt;BR /&gt;TwitterAgent.sinks.HDFS.hdfs.fileType = DataStream&lt;BR /&gt;TwitterAgent.sinks.HDFS.hdfs.writeFormat = Text&lt;BR /&gt;TwitterAgent.sinks.HDFS.hdfs.batchSize = 1000&lt;BR /&gt;TwitterAgent.sinks.HDFS.hdfs.rollSize = 0&lt;BR /&gt;TwitterAgent.sinks.HDFS.hdfs.rollCount = 10000&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;I using the follown pluggins:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN&gt;flume-sources-1.0-SNAPSHOT.jar&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN&gt;twitter4j-core-2.2.6.jar&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN&gt;twitter4j-media-support-2.2.6.jar&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN&gt;twitter4j-stream-2.2.6.jar&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;(I replace the version of the twitter4j-*-3.0.3.jar for the twitter4j-*-2.2.6.jar)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;also the directory using hdfs user&lt;/P&gt;&lt;P&gt;hadoop fs -ls /user/flume :&amp;nbsp;&lt;/P&gt;&lt;P&gt;drwxrwxrwx - flume flume &amp;nbsp;/user/flume/tweets&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;U&gt;core-site.xml ( at /hadoop/conf ) i Add:&lt;/U&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;lt; property &amp;gt;&lt;BR /&gt;&amp;lt; name &amp;gt;fs.default.name&amp;lt; / name &amp;gt;&lt;BR /&gt;&amp;lt; value &amp;gt;hdfs://localhost:8020&amp;lt; / value &amp;gt;&lt;BR /&gt;&amp;lt; /property &amp;gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I also run&amp;nbsp;&lt;EM&gt;hadoop dfsadmin -safemode leave&amp;nbsp;&lt;/EM&gt;on the host where I left the Flume Agent as &lt;EM&gt;HDFS&lt;/EM&gt;&amp;nbsp;user&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I really appreciate your help, on this issue.&lt;/P&gt;&lt;P&gt;&amp;nbsp;Regards,&lt;/P&gt;&lt;P&gt;AR&lt;/P&gt;</description>
      <pubDate>Wed, 03 Jun 2015 07:17:31 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Flume-HDFS-IO-error-ConnectException/m-p/28157#M6165</guid>
      <dc:creator>anton_85</dc:creator>
      <dc:date>2015-06-03T07:17:31Z</dc:date>
    </item>
    <item>
      <title>Re: Flume + HDFS IO error + ConnectException</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Flume-HDFS-IO-error-ConnectException/m-p/28190#M6166</link>
      <description>&lt;P&gt;I find the solution myself, and I left you here, .. in case anyone has the same error..&lt;/P&gt;&lt;P&gt;my error was ( because i was in a cluster )&amp;nbsp;&lt;/P&gt;&lt;P&gt;I should point into the hadoop host.. &amp;nbsp;so .. I change the address.. here&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;TwitterAgent.sinks.HDFS.hdfs.path = hdfs://&lt;U&gt;&lt;EM&gt;192.168.109.6&lt;/EM&gt;&lt;/U&gt;:8020/user/flume/tweets/%Y/%m/&lt;/SPAN&gt;&lt;SPAN&gt;%d/%H/&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;and everything was running smoothly&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;thanks&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 04 Jun 2015 05:07:00 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Flume-HDFS-IO-error-ConnectException/m-p/28190#M6166</guid>
      <dc:creator>anton_85</dc:creator>
      <dc:date>2015-06-04T05:07:00Z</dc:date>
    </item>
    <item>
      <title>Re: Flume + HDFS IO error + ConnectException</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Flume-HDFS-IO-error-ConnectException/m-p/39154#M6167</link>
      <description>&lt;P&gt;Hai, as you explaned above you changed some address for solving HDFS IO error and i not see any chage in address which you given in solution can you explain clear what you done for solving above error&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 30 Mar 2016 05:39:31 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Flume-HDFS-IO-error-ConnectException/m-p/39154#M6167</guid>
      <dc:creator>Tejaponnaluru</dc:creator>
      <dc:date>2016-03-30T05:39:31Z</dc:date>
    </item>
    <item>
      <title>Re: Flume + HDFS IO error + ConnectException</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Flume-HDFS-IO-error-ConnectException/m-p/57949#M6168</link>
      <description>&lt;P&gt;In my case the problem was the port number which was incorrect. I ensured that I used the Namenode port.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 24 Jul 2017 22:06:30 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Flume-HDFS-IO-error-ConnectException/m-p/57949#M6168</guid>
      <dc:creator>tmndungu</dc:creator>
      <dc:date>2017-07-24T22:06:30Z</dc:date>
    </item>
  </channel>
</rss>

