<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Storm - missing messages in pipeline in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162510#M57352</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/14221/ambudsharma.html" nodeid="14221"&gt;@Ambud Sharma&lt;/A&gt;Yes. There is a case where the message from Bolt 2 doesn't get written but from bolt3 should get written. But if Bolt2 output is written, Bolt 3 output should always be there. vice versa is not true. Is that a problem? &lt;/P&gt;&lt;P&gt;We are not anchoring tuples. We are extending BaseBasicBolt and from I understand we need to anchor tuples only if we extend BaseRichBolt..Is that incorrect?&lt;/P&gt;&lt;P&gt;No, we are not doing any microbatching.&lt;/P&gt;</description>
    <pubDate>Tue, 21 Mar 2017 22:21:16 GMT</pubDate>
    <dc:creator>dlaxmi1234</dc:creator>
    <dc:date>2017-03-21T22:21:16Z</dc:date>
    <item>
      <title>Storm - missing messages in pipeline</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162508#M57350</link>
      <description>&lt;P&gt;Hi all..We are noticing that there are some messages 
which get lost during storm processing..below is a brief outline of our 
pipeline.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="13712-smouw.png" style="width: 2268px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/20575i0337B8855C03FACC/image-size/medium?v=v2&amp;amp;px=400" role="button" title="13712-smouw.png" alt="13712-smouw.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;IMG alt="" /&gt;&lt;/P&gt;&lt;P&gt;&lt;IMG alt="" /&gt;&lt;/P&gt;&lt;P&gt;We have messages coming to Kafka which then get consumed by 2 
different kafka spouts in Storm. One Spout writes the message to raw 
stream and other storm starts processing the message. We need to store 
the output of Bolt2 to HDFS and also send it down for further processing
 which will then eventually end up in ADLS as well.&lt;/P&gt;&lt;P&gt;All the 3 HDFS bolts are configured to write to different folder 
structures in ADLS. In an ideal scenario I should see all the 3 messages
 in ADLS ( raw, out of bolt2 and output of bolt3). But we are noticing 
that raw gets written always but sometimes only one of the output (bolt2
 or bolt3) gets written to ADLS. Its inconsistent on which one misses. 
Sometimes both get written. There aren't any errors/exceptions in log messages. &lt;/P&gt;&lt;P&gt;Did anyone run into such issues? Any insight will be appreciated. Are there any good monitoring tools other than Storm UI that gives insight into what is going on? We 
are using HDInsight and are hosted on Azure and are using Storm 1.0.1&lt;/P&gt;&lt;P&gt;Thanks. &lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 11:41:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162508#M57350</guid>
      <dc:creator>dlaxmi1234</dc:creator>
      <dc:date>2019-08-18T11:41:47Z</dc:date>
    </item>
    <item>
      <title>Re: Storm - missing messages in pipeline</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162509#M57351</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/16691/dlaxmi1234.html" nodeid="16691"&gt;@Laxmi Chary&lt;/A&gt; thanks for your question. 

Do you know if there's ever a case where Message from Bolt 2 doesn't get written but from Bolt 3 does get written?

Are you anchoring tuples in your topology? collector.emit(tuple, new Field())  [the tuple is the anchor]&lt;/P&gt;&lt;P&gt;Are you doing any microbatching in your topology?&lt;/P&gt;</description>
      <pubDate>Sat, 18 Mar 2017 01:51:00 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162509#M57351</guid>
      <dc:creator>ambud_sharma1</dc:creator>
      <dc:date>2017-03-18T01:51:00Z</dc:date>
    </item>
    <item>
      <title>Re: Storm - missing messages in pipeline</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162510#M57352</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/14221/ambudsharma.html" nodeid="14221"&gt;@Ambud Sharma&lt;/A&gt;Yes. There is a case where the message from Bolt 2 doesn't get written but from bolt3 should get written. But if Bolt2 output is written, Bolt 3 output should always be there. vice versa is not true. Is that a problem? &lt;/P&gt;&lt;P&gt;We are not anchoring tuples. We are extending BaseBasicBolt and from I understand we need to anchor tuples only if we extend BaseRichBolt..Is that incorrect?&lt;/P&gt;&lt;P&gt;No, we are not doing any microbatching.&lt;/P&gt;</description>
      <pubDate>Tue, 21 Mar 2017 22:21:16 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162510#M57352</guid>
      <dc:creator>dlaxmi1234</dc:creator>
      <dc:date>2017-03-21T22:21:16Z</dc:date>
    </item>
    <item>
      <title>Re: Storm - missing messages in pipeline</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162511#M57353</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/14221/ambudsharma.html" nodeid="14221"&gt;@Ambud Sharma&lt;/A&gt; wondering if u have more insight. Let me know if you need more details. TIA&lt;/P&gt;</description>
      <pubDate>Wed, 22 Mar 2017 22:40:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162511#M57353</guid>
      <dc:creator>dlaxmi1234</dc:creator>
      <dc:date>2017-03-22T22:40:39Z</dc:date>
    </item>
    <item>
      <title>Re: Storm - missing messages in pipeline</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162512#M57354</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/16691/dlaxmi1234.html" nodeid="16691"&gt;@Laxmi Chary&lt;/A&gt;&lt;P&gt;You should be anchoring, without anchoring Storm doesn't guarantee at least once semantics which means it's best effort.&lt;/P&gt;&lt;P&gt;Anchoring is a factor of your delivery semantics, you should be using BaseRichBolt, otherwise you don't have a collector.&lt;/P&gt;</description>
      <pubDate>Thu, 23 Mar 2017 01:33:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162512#M57354</guid>
      <dc:creator>ambud_sharma1</dc:creator>
      <dc:date>2017-03-23T01:33:04Z</dc:date>
    </item>
    <item>
      <title>Re: Storm - missing messages in pipeline</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162513#M57355</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/14221/ambudsharma.html" nodeid="14221"&gt;@Ambud Sharma&lt;/A&gt; Doesn't the BaseBasicBolt do that for u?&lt;/P&gt;</description>
      <pubDate>Thu, 23 Mar 2017 01:57:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162513#M57355</guid>
      <dc:creator>dlaxmi1234</dc:creator>
      <dc:date>2017-03-23T01:57:57Z</dc:date>
    </item>
    <item>
      <title>Re: Storm - missing messages in pipeline</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162514#M57356</link>
      <description>&lt;P&gt;This is what was mentioned in Storm Applied book &lt;/P&gt;&lt;P&gt;"The beauty of using BaseBasicBolt as our base class is that it automatically provides anchoring and acking for us." and we are using BaseBasicBolt. Are you saying that this is incorrect?&lt;/P&gt;</description>
      <pubDate>Thu, 23 Mar 2017 02:00:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162514#M57356</guid>
      <dc:creator>dlaxmi1234</dc:creator>
      <dc:date>2017-03-23T02:00:04Z</dc:date>
    </item>
    <item>
      <title>Re: Storm - missing messages in pipeline</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162515#M57357</link>
      <description>&lt;P&gt;Yes, that is incorrect, &lt;A href="https://github.com/apache/storm/blob/master/storm-core/src/jvm/org/apache/storm/topology/base/BaseBasicBolt.java" target="_blank"&gt;https://github.com/apache/storm/blob/master/storm-core/src/jvm/org/apache/storm/topology/base/BaseBasicBolt.java&lt;/A&gt; this bolt class doesn't even have a collector to acknowledge messages.&lt;/P&gt;</description>
      <pubDate>Thu, 23 Mar 2017 02:06:26 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162515#M57357</guid>
      <dc:creator>ambud_sharma1</dc:creator>
      <dc:date>2017-03-23T02:06:26Z</dc:date>
    </item>
    <item>
      <title>Re: Storm - missing messages in pipeline</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162516#M57358</link>
      <description>&lt;P&gt;ok. Do you know if there is any documentation on this?&lt;/P&gt;</description>
      <pubDate>Thu, 23 Mar 2017 02:11:26 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162516#M57358</guid>
      <dc:creator>dlaxmi1234</dc:creator>
      <dc:date>2017-03-23T02:11:26Z</dc:date>
    </item>
    <item>
      <title>Re: Storm - missing messages in pipeline</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162517#M57359</link>
      <description>&lt;P&gt;Here's some example code to show you how explicit anchoring and acking can be done: 
&lt;A href="https://github.com/Symantec/hendrix/blob/current/hendrix-storm/src/main/java/io/symcpe/hendrix/storm/bolts/ErrorBolt.java" target="_blank"&gt;https://github.com/Symantec/hendrix/blob/current/hendrix-storm/src/main/java/io/symcpe/hendrix/storm/bolts/ErrorBolt.java&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 23 Mar 2017 02:12:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162517#M57359</guid>
      <dc:creator>ambud_sharma1</dc:creator>
      <dc:date>2017-03-23T02:12:10Z</dc:date>
    </item>
    <item>
      <title>Re: Storm - missing messages in pipeline</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162518#M57360</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/16691/dlaxmi1234.html" nodeid="16691"&gt;@Shravanthi&lt;/A&gt; please accept the answer if this solved your issue.&lt;/P&gt;</description>
      <pubDate>Wed, 29 Mar 2017 03:30:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162518#M57360</guid>
      <dc:creator>ambud_sharma1</dc:creator>
      <dc:date>2017-03-29T03:30:35Z</dc:date>
    </item>
    <item>
      <title>Re: Storm - missing messages in pipeline</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162519#M57361</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/14221/ambudsharma.html" nodeid="14221"&gt;@Ambud Sharma&lt;/A&gt; we are testing this change and will accept once we are done. I am still not 100% convinced that this solves the problem since the Storm documentation says BasicBolt does the acking and anchoring &lt;A href="http://storm.apache.org/releases/1.0.1/Guaranteeing-message-processing.html"&gt;http://storm.apache.org/releases/1.0.1/Guaranteeing-message-processing.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Search for BasicBolt in that link and you will find "Storm has an interface called &lt;CODE&gt;BasicBolt&lt;/CODE&gt; that encapsulates this pattern for you."&lt;/P&gt;</description>
      <pubDate>Wed, 29 Mar 2017 10:07:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Storm-missing-messages-in-pipeline/m-p/162519#M57361</guid>
      <dc:creator>dlaxmi1234</dc:creator>
      <dc:date>2017-03-29T10:07:54Z</dc:date>
    </item>
  </channel>
</rss>

