<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Apache Nifi processor to convert 'Control A' (\u0001) separated file to AVRO in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Apache-Nifi-processor-to-convert-Control-A-u0001-separated/m-p/161047#M123432</link>
    <description>&lt;P&gt;How do you know which delimiter is used for a particular file? If you can determine that from the content, you might be able to use RouteContent to send all \u0001-delimited files to one ConvertCSVToAvro (using the technique I describe above), all \u0002 files to another, and so on. Likewise if you can somehow extract the delimiter into an attribute you can use RouteOnAttribute rather than RouteContent.&lt;/P&gt;&lt;P&gt;Why would you like to avoid ReplaceText? The content of the flow files will be altered when converting to Avro, so you won't have the original input at that point. If it is a performance issue, do you think my suggestion above would work for your use case?&lt;/P&gt;</description>
    <pubDate>Thu, 03 Nov 2016 22:08:00 GMT</pubDate>
    <dc:creator>mburgess</dc:creator>
    <dc:date>2016-11-03T22:08:00Z</dc:date>
  </channel>
</rss>

