<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Nifi InferAVROSchema in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nifi-InferAVROSchema/m-p/143456#M48323</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/11016/avijeetd.html" nodeid="11016"&gt;@Avijeet Dash&lt;/A&gt;  Take a look at this template for some examples.&lt;/P&gt;&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/11016/avijeetd.html" nodeid="11016"&gt;&lt;/A&gt; &lt;A href="https://community.cloudera.com/legacyfs/online/attachments/10186-avroschemascenarios.xml"&gt;avroschemascenarios.xml&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 08 Dec 2016 23:17:02 GMT</pubDate>
    <dc:creator>bbende</dc:creator>
    <dc:date>2016-12-08T23:17:02Z</dc:date>
    <item>
      <title>Nifi InferAVROSchema</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nifi-InferAVROSchema/m-p/143452#M48319</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;How does Infer schema works in the flow, does it keep inferring for every single dataflow, is that a good approach? shoudn't we use convertCSVToAVRO by providing a avsc file created by Kite. &lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;&lt;P&gt;Avijeet&lt;/P&gt;</description>
      <pubDate>Thu, 08 Dec 2016 17:04:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nifi-InferAVROSchema/m-p/143452#M48319</guid>
      <dc:creator>avijeetd</dc:creator>
      <dc:date>2016-12-08T17:04:14Z</dc:date>
    </item>
    <item>
      <title>Re: Nifi InferAVROSchema</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nifi-InferAVROSchema/m-p/143453#M48320</link>
      <description>&lt;P&gt;Each CSV or JSON that comes in the InferAvroSchema could be different so it will infer the schema for each flow file and put the schema where you specify the schema destination, either flow file content or a flow file attribute. Then you can use that attribute in ConvertCsvToAvro as the schema by referencing ${inferred.avro.schema}.&lt;/P&gt;&lt;P&gt;If you are sending only one type of CSV in to ConvertCsvToAvro then it would be more efficient for you to define the Avro schema you want and not use InferAvroSchema.&lt;/P&gt;</description>
      <pubDate>Thu, 08 Dec 2016 22:25:11 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nifi-InferAVROSchema/m-p/143453#M48320</guid>
      <dc:creator>bbende</dc:creator>
      <dc:date>2016-12-08T22:25:11Z</dc:date>
    </item>
    <item>
      <title>Re: Nifi InferAVROSchema</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nifi-InferAVROSchema/m-p/143454#M48321</link>
      <description>&lt;P&gt;HI &lt;A rel="user" href="https://community.cloudera.com/users/363/bbende.html" nodeid="363"&gt;@Bryan Bende&lt;/A&gt;, Thanks.&lt;/P&gt;&lt;P&gt;will it not be the case when a stream contains messages for one particular schema, I noticed KAFKA is trying to implement something similar, putting a Inferschema in a dataflow seems like a dangerous thing to do.&lt;/P&gt;</description>
      <pubDate>Thu, 08 Dec 2016 22:39:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nifi-InferAVROSchema/m-p/143454#M48321</guid>
      <dc:creator>avijeetd</dc:creator>
      <dc:date>2016-12-08T22:39:12Z</dc:date>
    </item>
    <item>
      <title>Re: Nifi InferAVROSchema</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nifi-InferAVROSchema/m-p/143455#M48322</link>
      <description>&lt;P&gt;It depends how you construct your dataflow in NiFi... You could set it up so that you have several logical streams that each have their own ConvertCsvToAvro processor, or you could have several processors feeding into the same ConvertCsvToAvro processor.&lt;/P&gt;&lt;P&gt;Kafka itself does not enforce anything related to a schema, but Confluent has a schema registry with serializers and deserializers and they can enforce that any message being written to a topic must conform to the schema for that topic.&lt;/P&gt;</description>
      <pubDate>Thu, 08 Dec 2016 23:01:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nifi-InferAVROSchema/m-p/143455#M48322</guid>
      <dc:creator>bbende</dc:creator>
      <dc:date>2016-12-08T23:01:08Z</dc:date>
    </item>
    <item>
      <title>Re: Nifi InferAVROSchema</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nifi-InferAVROSchema/m-p/143456#M48323</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/11016/avijeetd.html" nodeid="11016"&gt;@Avijeet Dash&lt;/A&gt;  Take a look at this template for some examples.&lt;/P&gt;&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/11016/avijeetd.html" nodeid="11016"&gt;&lt;/A&gt; &lt;A href="https://community.cloudera.com/legacyfs/online/attachments/10186-avroschemascenarios.xml"&gt;avroschemascenarios.xml&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 08 Dec 2016 23:17:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nifi-InferAVROSchema/m-p/143456#M48323</guid>
      <dc:creator>bbende</dc:creator>
      <dc:date>2016-12-08T23:17:02Z</dc:date>
    </item>
    <item>
      <title>Re: Nifi InferAVROSchema</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nifi-InferAVROSchema/m-p/143457#M48324</link>
      <description>&lt;P&gt;I have been using InferAvroSchema in dataflows for a while and:&lt;/P&gt;&lt;P&gt;1. It infers the schema for each file on input&lt;/P&gt;&lt;P&gt;2. saves the schema into ${inferred.avro.schema} attribute for that flowfile&lt;/P&gt;&lt;P&gt;3. it is not good for production use&lt;/P&gt;&lt;P&gt;As schema inferrence is only a guess, I would recommend you to infer your schema once (double check manually for correctness) and then use it as a static schema in ConvertAvroTo... processors (prepend RouteOnAttribute if you need different schemas). In production, this is what you want. Sometimes, the data can be misleading for inferrence. For example, I have input CSV with empty column, which in fact is nullable long column. Schema inferrence cannot guess it is nullable long. So for one input file, where the values are filled in as numbers, it guesses long type, and for another, where the column is empty, it guesses nullable string...&lt;/P&gt;</description>
      <pubDate>Fri, 09 Dec 2016 16:02:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nifi-InferAVROSchema/m-p/143457#M48324</guid>
      <dc:creator>michal_klempa</dc:creator>
      <dc:date>2016-12-09T16:02:57Z</dc:date>
    </item>
    <item>
      <title>Re: Nifi InferAVROSchema</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nifi-InferAVROSchema/m-p/143458#M48325</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/3603/michalklempa.html" nodeid="3603"&gt;@Michal Klempa&lt;/A&gt; I agree. Thanks.&lt;/P&gt;</description>
      <pubDate>Mon, 12 Dec 2016 13:04:59 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Nifi-InferAVROSchema/m-p/143458#M48325</guid>
      <dc:creator>avijeetd</dc:creator>
      <dc:date>2016-12-12T13:04:59Z</dc:date>
    </item>
  </channel>
</rss>

