Support Questions

Find answers, ask questions, and share your expertise
Announcements
Check out our newest addition to the community, the Cloudera Data Analytics (CDA) group hub.

What is the limit for Maximum number of characters in a single event for Flume-1.8 ?

Explorer

I'm running Flume-1.8 with the spoolDir as flume source and hdfs as a sink. I need to process log files in spoolDir with the lines containing more than 10,000 characters and guarantee the file in the hdfs will contain exactly the same number of lines.

According to the Flume-1.8 User Guide the DEFAULT Maximum number of characters to include in a single event is 2048. If a line exceeds this length, it is truncated, and the remaining characters on the line will appear in a subsequent event.

I need to set this value of deserializer.maxLineLength as bigger as possible to prevent input file's lines truncating.

Is there any limit for the value of deserializer.maxLineLength in Flume-1.8 ?

Thanks

1 REPLY 1

Hello @Vadim Dzyuban!
I had never set this parameter when I had to use flume, but, looking at the code here, seems to be limited to the bounds of an INTEGER (I didn't test the limits for this parameters).

Below the class used by Deserializer, and indeed the default value it's 2048.
https://github.com/apache/flume/blob/flume-1.8/flume-ng-core/src/main/java/org/apache/flume/serializ...
And the context class get the parameter of your config file
https://github.com/apache/flume/blob/flume-1.8/flume-ng-configuration/src/main/java/org/apache/flume...

And declaring this parameter would be like:

agent.sources.<NAME>.deserializer.maxLineLength=#NUMBER

Hope this helps

Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.