Member since
06-26-2015
509
Posts
136
Kudos Received
114
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1289 | 09-20-2022 03:33 PM | |
3795 | 09-19-2022 04:47 PM | |
2252 | 09-11-2022 05:01 PM | |
2338 | 09-06-2022 02:23 PM | |
3643 | 09-06-2022 04:30 AM |
09-28-2022
12:18 AM
Can you share your settings?
... View more
09-21-2022
03:24 PM
Is your dev cluster running the exact same version of NiFi as production, including the NiFi lib folder?
... View more
09-20-2022
03:33 PM
1 Kudo
@progowl , Yes, it is. Check out the docker compose configuration in this article: https://community.cloudera.com/t5/Community-Articles/NiFi-cluster-sandbox-on-Docker/ta-p/346271 Cheers, André
... View more
09-19-2022
04:47 PM
@SAMSAL @ChuckE , I believe parsing the schema for each flowfile that goes through the processor would be too expensive. Because of that, the schema is parsed only once when the processor is scheduled and used for every flowfile. That's why the attribute values cannot be used for this property. Having a schema hashmap<filename, parsed_schema> internally could be an interesting idea so that the processor would parse the schema onTrigger only once for every schema file name and reuse it afterwards. Obviously memory usage could be a problem if you have too many schemas, but I don't think this is likely to happen. This doesn't happen currently, but it would be a nice feature request IMO. Currently, you can either do that with a scripting processing or use RouteOnAttribute to send each message to a ValidateXML processor with the correct schema. Cheers, André
... View more
09-11-2022
05:01 PM
1 Kudo
@sekhar1 , The CDP user that you're using to execute your job needs an "IDBroker mapping" to a valid AWS role to be able to access the contents of the S3 bucket. Please check this: https://docs.cloudera.com/cdf-datahub/7.2.10/nifi-hive-ingest/topics/cdf-datahub-hive-ingest-idbroker-mapping.html Cheers, André
... View more
09-07-2022
05:04 AM
1 Kudo
Everything you do in the NiFi UI can also be done using the NiFi REST API. So if you want/need to automate it, it's totally possible and not difficult. Cheers, André
... View more
09-07-2022
03:57 AM
@SandyClouds , Have a look at Parameters and Parameters Contexts: https://nifi.apache.org/docs/nifi-docs/html/user-guide.html#Parameters Cheers André
... View more
09-06-2022
03:24 PM
@rafy , Same approach. Just create the processor(s) and let the files flow through it 🙂 Cheers, André
... View more
09-06-2022
02:23 PM
@rafy You shouldn't need to extract the content as an attribute. Instead, use ReplaceText to replace the contents of the flowfile with the SQL template, like this: INSERT INTO my_table (xml_col) VALUES ('$1') Where $1 is a reference to the default regex capture group that captures the entire content. In reality, it requires a bit more labour, since you have to escape the single quotes in the XML first. Cheers, André
... View more
09-06-2022
02:17 PM
@VenkatG , What are you trying to use this resulting JSON for? That seems a bit odd of a format to me. Nevertheless, here's a JOLT to achieve that (or close): [
{
"operation": "default",
"spec": {
"__timestamp": "${now()}"
}
},
{
"operation": "shift",
"spec": {
"custId": "Rows[0].values[0]",
"name": "Rows[0].values[1]",
"Address2": "Rows[0].values[2]",
"Address1": "Rows[0].values[3]",
"zip": "Rows[0].values[4]",
"__timestamp": "Rows[0].values[5]"
}
},
{
"operation": "default",
"spec": {
"operationType": "Insert",
"Source": "Dev"
}
}
] Cheers, André
... View more