<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: PutKudu processor doesn't throw any exceptions and doesn't write data to Kudu in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/PutKudu-processor-doesn-t-throw-any-exceptions-and-doesn-t/m-p/207905#M169866</link>
    <description>&lt;P&gt;First of all, your article is great! It was very useful on my first steps with NiFi and Kudu. Thanks!&lt;/P&gt;&lt;P&gt;My dataflow is:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="43675-dataflow.png" style="width: 1523px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/16928i426D3575811B41CA/image-size/medium?v=v2&amp;amp;px=400" role="button" title="43675-dataflow.png" alt="43675-dataflow.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;I get data from REST API (for now it stopped, so you can see red square, but it works fine when i get data from it), then add needed attributes to dataflow (like timestamp, API's title), then split JSON array into JSON objests (no nested objects there, "key":"value" pairs only), then remove several fields via JOLT transformation (with symbols like '@'), then add dataflow attributes to JSON. For now i have flow contains JSON oblects with several "key":"value" data into each object. Keys like "field", "field1" or "field_name1". Values are numbers, strings(GUIDs or dates in different formats) or nulls. I checked JSON by stopping PutKudu and view data into queue.&lt;/P&gt;&lt;P&gt;My PutKudu configuration is:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="43679-putkudu-configuration.png" style="width: 787px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/16929iCA7064FFF355631B/image-size/medium?v=v2&amp;amp;px=400" role="button" title="43679-putkudu-configuration.png" alt="43679-putkudu-configuration.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;I tried to use several Flush Mode values (AUTO_FLUSH_SYNC, AUTO_FLUSH_BACKGROUND), different Batch size values (2, 10, 100, 1000, 10000) and different Record Readers (AvroReader, several JsonPathReaders).&lt;/P&gt;&lt;P&gt;For now I extract 1 field from JSON object via JsonPathReader and store it into Kudu table with 1 column. JsonPathReader configuration is:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="43682-jsonpathreader-configuration.png" style="width: 779px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/16930iC2CEC5829D75DB57/image-size/medium?v=v2&amp;amp;px=400" role="button" title="43682-jsonpathreader-configuration.png" alt="43682-jsonpathreader-configuration.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;I checked that key "fieldid" exists in JSON and its value is GUID string. Table "impala::default.test_activities" with 1 string column "json_str" created via Impala. I added 1 test row to it. Also i checked existing of table with "kudu table list localhost:7051" command on kudu_master.&lt;/P&gt;&lt;P&gt;AvroSchemaRegistry configuration is:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="43683-avroschemaregistry-configuration.png" style="width: 770px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/16931i805D46BA88720D9C/image-size/medium?v=v2&amp;amp;px=400" role="button" title="43683-avroschemaregistry-configuration.png" alt="43683-avroschemaregistry-configuration.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;and "All" schema is&lt;/P&gt;&lt;PRE&gt;{
  "type": "record",
  "name": "All",
  "fields": [
  {
    "name" : "json_str",
    "type" : "string"
  }]
}&lt;/PRE&gt;&lt;P&gt;Also i tried to use schema like this:&lt;/P&gt;&lt;PRE&gt;{
  "type": "record",
  "name": "All",
  "fields": [
  {
    "name" : "json_str",
    "type" : ["string","null"]
  }]
}&lt;/PRE&gt;&lt;P&gt;That what i have for now.&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;</description>
    <pubDate>Sun, 18 Aug 2019 04:30:37 GMT</pubDate>
    <dc:creator>dmitkoh</dc:creator>
    <dc:date>2019-08-18T04:30:37Z</dc:date>
  </channel>
</rss>

