Created 09-08-2016 03:38 PM
Hi All,
We have upgraded our environment from HDP 2.3.4 to HDP 2.4 and so review Storm topologies in order to use 1.0.1 version and new features. In old version, storm-kafka library (0.9.2) included the option "forceFromStart" in SpoutConfig that restore data from beginning (if set to true) or get only current value (if set to false) by kafka.
In last version the option is removed and I see that the spout get always data from beginning.
How we can replace the functionality in new version? I mean, I want get only the current value from spout, how I can proceed to do it?
Thanks in advance, Giuseppe
Created 09-09-2016 02:13 AM
Storm 1.0.1 is only available in HDP 2.5 so I'm wondering whether you have issues with dependencies. according to the latest documentation for 2.5 forceFeomStart is still there and defaults to false. http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.5.0/bk_storm-component-guide/content/storm-kafka...
Created 09-09-2016 02:13 AM
Storm 1.0.1 is only available in HDP 2.5 so I'm wondering whether you have issues with dependencies. according to the latest documentation for 2.5 forceFeomStart is still there and defaults to false. http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.5.0/bk_storm-component-guide/content/storm-kafka...
Created 09-09-2016 08:13 AM
Thank you Artem! Yes, I'm working with 2.5, I was wrong to write above...
The dependencies are ok but the KafkaConfig don't have "forceFromStart" properties. In fact, I've checked the KafkaConfig sources and it's missing, please see
So, it could be a refuse in documentation. In this case, how I can use this properties if it is not present in kafkaConfig?
Created 09-09-2016 02:26 PM
I will follow up with engineering on this but for now, can you try the following
kafka.api.OffsetRequest.LatestTime()
that should only grab latest messages
Created 09-09-2016 02:33 PM
looks like you're correct, forceFromStart is removed from Storm in favor of EarliestTime/LatestTime, it is a problem with documentation, I will speak to docs team to remove that and replace with correct info. The associated JIRAs are https://issues.apache.org/jira/browse/STORM-563 and STORM-650
Created 09-09-2016 03:31 PM
Thanks you Artem for clarification, I will proceed using new approach.
Created 09-09-2016 03:55 PM
@Giuseppe Maldarizzi once you confirm it works, please accept the answer to close the thread.
Created 09-09-2016 03:57 PM
Yes, it work, we can close the thread. Thanks again
Created 09-09-2016 05:25 PM
@Giuseppe Maldarizzi just heard back from engineering, also please look at ignoreZKOffsets parameter in place of forceFromStart, documentation will be updated. https://github.com/apache/storm/tree/master/external/storm-kafka#how-kafkaspout-stores-offsets-of-a-...
Created 09-12-2016 07:44 AM
Perfect, I will try it. Thank you again