Member since
07-19-2018
613
Posts
101
Kudos Received
117
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 5096 | 01-11-2021 05:54 AM | |
| 3422 | 01-11-2021 05:52 AM | |
| 8789 | 01-08-2021 05:23 AM | |
| 8385 | 01-04-2021 04:08 AM | |
| 36689 | 12-18-2020 05:42 AM |
05-18-2020
05:34 AM
@arunnalpet You should be able to just add the $. If it doesn't work, maybe have to escape it: \$
... View more
05-18-2020
04:55 AM
1 Kudo
Great answer many thanks. For the record, I had the data in a FlowFile. I needed a bit of regex magic to remove the quotation marks after unescaping.
... View more
05-16-2020
05:33 AM
@johndcal A namespace is not required within the avro schema source in Schema Registry. In the context of avro spec. In order to create an avro schema in the Schema Registry, you have to send the first call to create the schema entity. The next call is then to add the actual avro schema to the existing entity. This is just the behavior of the Schema Registry. You can find some lessons I created in how to use the registry: https://community.cloudera.com/t5/Community-Articles/Using-the-Schema-Registry-API/ta-p/286194 I also have an article showing how to fully automate the creation of Avro Schemas from CSV file (column name and data type) using the Schema Registry, Hive, and NiFI: https://community.cloudera.com/t5/Community-Articles/How-to-automate-creation-of-Avro-and-Hive-Schemas-using-NiFi/ta-p/293183 If this answer resolves your issue or allows you to move forward, please choose to ACCEPT this solution and close this topic. If you have further dialogue on this topic please comment here or feel free to private message me. If you have new questions related to your Use Case please create separate topic and feel free to tag me in your post. Thanks, Steven @ DFHZ
... View more
05-16-2020
05:21 AM
@Genentech I am not sure if this is the answer you are looking for, but my recommendation is to leave your original table as is and select results from that into the parquet table. I am a firm believe of using backup copies, staging, copies or temporary copies of original data sources on the path through translation to final source. Make a new empty table with the parquet format you want. The format must match. Next execute: INSERT INTO final_table SELECT * from source_table; If you need to retain the same original table name, you can alter or drop the original table, and execute a rename statement on the final_table above. If this answer resolves your issue or allows you to move forward, please choose to ACCEPT this solution and close this topic. If you have further dialogue on this topic please comment here or feel free to private message me. If you have new questions related to your Use Case please create separate topic and feel free to tag me in your post. Thanks, Steven @ DFHZ
... View more
05-13-2020
08:27 AM
@satishjan1 The initial question is asking about setting the hostname. The information you reference is telling you to do that, but for a different operating system. My first response was telling you how to do it for RHEL. For your next question, you do not have to set the hostname in /etc/sysconfig/network, you have to do it the way required for your operating system. See Above. The hostname must be set, and persist after reboot. If you do not set the hostname before installing the cluster you will have unmentionable problems with services and components later on down the road.
... View more
05-12-2020
07:44 AM
Use query record processor, have CSVReader and JSONWriter output. do SELECT satellite_name FROM FLOWFILE Next processor can grab an attribute
... View more
05-09-2020
09:27 AM
@michaelli You can use the CM API to fetch the service configurations by executing below command from terminal # curl -u admin_uname:admin_pass "http://cm_server_host:7180/api/v19/cm/deployment" The password should be present under the name : hive_metastore_database_password Refer: https://docs.cloudera.com/documentation/enterprise/5-14-x/topics/cm_intro_api.html#concept_dnn_cr5_mr__section_ztf_lyt_xr Hope this helps, Paras Was your question answered? Make sure to mark the answer as the accepted solution. If you find a reply useful, say thanks by clicking on the thumbs up button.
... View more
05-04-2020
07:25 AM
@varun_rathinam Accessing json in an array object via EvaluateJsonPath can be quite confusing. I also notice the structure of your json is kind of confusing with same values in both. I have adjusted id2 for cc and dd for testing so that I can tell id1 and id2 values apart. The solution you want is (see template for exact string values): Notice we use the normal tree for each json object ( $.object ) then access the array ( 0, 1 ) then access the array's objects. Also notice it is possible to access the json object array with or without a . before the [. Reference: https://community.cloudera.com/t5/Support-Questions/how-to-extract-fields-in-flow-file-which-are-surrounded-by/m-p/208635 You can also find my template during testing of your issue on my GitHub: https://github.com/steven-dfheinz/NiFi-Templates/blob/master/NiFI_EvaluateJsonPath_Demo.xml If this answer resolves your issue or allows you to move forward, please choose to ACCEPT this solution and close this topic. If you have further dialogue on this topic please comment here or feel free to private message me. If you have new questions related to your Use Case please create separate topic and feel free to tag me in your post. Thanks, Steven @ DFHZ
... View more
04-30-2020
02:07 PM
Thanks for the revert mate. Much Appreciated.
... View more