Support Questions

Jonathan_M_Maes · ‎01-13-2017

Storm Version: 0.10.0.2.4

Using a Kafka Spout.

How does storm handle failed tuples?

How many times will storm retry a failed tuple?

What frequency will storm retry the failed tuple?

What is the max tuple count a topology can handle between all spouts and bolts?

ambud_sharma1 · ‎01-14-2017

Hi @Jon Maestas

Answering your questions inline:

How does storm handle failed tuples?

When you are using at least once processing (acking and anchoring) is when Storm will handle tuple failures by retries. Retry means re-emitting a tuple from Spout.

How many times will storm retry a failed tuple?

This depends on the Spout's logic, in case of Kafka Spout for 0.10.x Storm there's the ability for exponential backoff retry (https://github.com/apache/storm/blob/0.10.x-branch/external/storm-kafka/src/jvm/storm/kafka/ExponentialBackoffMsgRetryManager.java)

What frequency will storm retry the failed tuple?

ExponentialBackoff will determine the frequency.

What is the max tuple count a topology can handle between all spouts and bolts?

I am guessing you are asking for maximum number of tuples at any given point can be in Storm's buffers? This = Bolt Count * Executor Count * TOPOLOGY_EXECUTOR_RECEIVE_BUFFER_SIZE + Bolt Count * Executor Count * TOPOLOGY_EXECUTOR_SEND_BUFFER_SIZE

You can find out the value of these buffers from Ambari -> Storm -> Config -> Search "buffer"

Please note that the above is theoretical maximum, Max Spout Pending (topology.max.spout.pending) throttles the number of in-flight tuples from the Spout.

There's also transfer buffers which will add bit more to the above calculated number.

Please refer to Michael Noll's blog for more details about Storm Buffers (http://www.michael-noll.com/blog/2013/06/21/understanding-storm-internal-message-buffers/)

Hope this answers your questions.

View solution in original post

ambud_sharma1 · ‎01-14-2017