Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

What is the limit for Maximum number of characters in a single event for Flume-1.8 ?

Highlighted

What is the limit for Maximum number of characters in a single event for Flume-1.8 ?

New Contributor

I'm running Flume-1.8 with the spoolDir as flume source and hdfs as a sink. I need to process log files in spoolDir with the lines containing more than 10,000 characters and guarantee the file in the hdfs will contain exactly the same number of lines.

According to the Flume-1.8 User Guide the DEFAULT Maximum number of characters to include in a single event is 2048. If a line exceeds this length, it is truncated, and the remaining characters on the line will appear in a subsequent event.

I need to set this value of deserializer.maxLineLength as bigger as possible to prevent input file's lines truncating.

Is there any limit for the value of deserializer.maxLineLength in Flume-1.8 ?

Thanks

1 REPLY 1

Re: What is the limit for Maximum number of characters in a single event for Flume-1.8 ?

Hello @Vadim Dzyuban!
I had never set this parameter when I had to use flume, but, looking at the code here, seems to be limited to the bounds of an INTEGER (I didn't test the limits for this parameters).

Below the class used by Deserializer, and indeed the default value it's 2048.
https://github.com/apache/flume/blob/flume-1.8/flume-ng-core/src/main/java/org/apache/flume/serializ...
And the context class get the parameter of your config file
https://github.com/apache/flume/blob/flume-1.8/flume-ng-configuration/src/main/java/org/apache/flume...

And declaring this parameter would be like:

agent.sources.<NAME>.deserializer.maxLineLength=#NUMBER

Hope this helps