1973
Posts
1225
Kudos Received
124
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 2486 | 04-03-2024 06:39 AM | |
| 3840 | 01-12-2024 08:19 AM | |
| 2079 | 12-07-2023 01:49 PM | |
| 3064 | 08-02-2023 07:30 AM | |
| 4195 | 03-29-2023 01:22 PM |
08-11-2020
01:37 PM
Could be local security or file permissions or IPtables / Firewall between NiFi cluster and HDFS. https://www.edureka.co/community/30977/hadoop-hdfs-exception-in-createblockoutputstream Could be crypto libraries installed? https://community.cloudera.com/t5/Support-Questions/NiFi-PutHDFS-Writing-Zero-Bytes-appears-to-be-crypto-related/td-p/115493 If this NiFi installed via CDF/CFM Cloudera Manager? If so, please open a support ticket.
... View more
08-11-2020
01:33 PM
What kerberos ID are you logged into on your PC vs in the NiFi cluster. Perhaps the NiFi kerberos ID does not have WRITE permissions to that directory or you are not logged into Kerberos on NiFi cluster. People usually don't have the same kerberos credentials on their PC as on their cluster. Should be a service user not a personal user.
... View more
08-11-2020
11:36 AM
=== There are a number of possible of causes for this. The NameNode may be overloaded. Check the logs for messages that say "discarding calls..." There may not be enough (any) DataNode nodes running for the data to be written. Again, check the logs. Every DataNode on which the blocks were stored might be down (or not connected to the NameNode; it is impossible to distinguish the two). Looks like HDFS issues, can you post from Hue or HDFS command line https://cwiki.apache.org/confluence/display/HADOOP2/TroubleShooting
... View more
08-11-2020
08:41 AM
Can you post configuration information, more logs, plus some details on: NiFi version HDFS version Encrypted file system? Cloud? OS version JDK version / JVM version CDH/HDP/CDP version PutHDFS Settings What kind of data? Example data
... View more
08-06-2020
06:11 AM
That's good news. That's a tricky one. Glad things are working for you.
... View more
08-05-2020
10:27 AM
Maybe set via file https://community.cloudera.com/t5/Community-Articles/Exploring-Apache-NiFi-1-10-Parameters-and-Stateless-Engine/ta-p/283266 https://nifi.apache.org/docs/nifi-docs/rest-api/ https://nifi.apache.org/docs/nifi-registry-docs/rest-api/rest-api.html#_versionedparameter /flow/parameter-contexts Gets all Parameter Contexts
... View more
07-29-2020
08:01 AM
This is a great decision chart. I would add Flink SQL for querying events in stream and for querying Kafka topics. If you have Time Series data definitely use Druid, if your data is not timeseries or timestamp driven do not use Druid, use Kudu instead. https://druid.apache.org/faq https://druid.apache.org/docs/latest/comparisons/druid-vs-kudu.html https://druid.apache.org/docs/latest/comparisons/druid-vs-key-value.html
... View more
07-21-2020
09:19 AM
1 Kudo
The easiest way to grab monitoring data is via the NiFi REST API. Also everything in the NiFi UI is done through REST calls which you can call programmatically. Please read the NiFi docs they are linked directly from your running NiFi application or on the web. They are very thorough and have all the information you could want: https://nifi.apache.org/docs/nifi-docs/. If you are not running NiFi 1.11.4, I recommend you please upgrade. This is supported by Cloudera on multiple platforms. NiFi Rest API https://nifi.apache.org/docs/nifi-docs/rest-api/ There's also an awesome Python wrapper for that REST API: https://pypi.org/project/nipyapi/ Also in NiFi flow programming, every time you produce data to Kafka you get metadata back in FlowFile Attributes. You can push those attributes directly to a kafka topic if you want. So after your PublishKafkaRecord_2_0 1.11.4 so for success read the attributes on # of record and other data then AttributesToJson and push to another topic. you may want a mergerecord in there to aggregate a few of those together. If you are interested in Kafka metrics/record counts/monitoring then you must use Cloudera Streams Messaging Manager, it provides a full Web UI, Monitoring Tool, Alerts, REST API and everything you need for monitoring every producer, consumer, broker, cluster, topic, message, offset and Kafka component. The best way to get NiFi stats is to use the NiFi Reporting Tasks, I like the SQL Reporting task. SQL Reporting Tasks are very powerful and use standard SELECT * FROM JVM_METRICS style reporting, see my article: https://www.datainmotion.dev/2020/04/sql-reporting-task-for-cloudera-flow.html Monitoring Articles https://www.datainmotion.dev/2019/04/monitoring-number-of-of-flow-files.html https://www.datainmotion.dev/2019/03/apache-nifi-operations-and-monitoring.html Other Resources https://www.datainmotion.dev/2019/10/migrating-apache-flume-flows-to-apache_9.html https://www.datainmotion.dev/2019/08/using-cloudera-streams-messaging.html https://dev.to/tspannhw/apache-nifi-and-nifi-registry-administration-3c92 https://dev.to/tspannhw/using-nifi-cli-to-restore-nifi-flows-from-backups-18p9 https://nifi.apache.org/docs/nifi-docs/html/toolkit-guide.html https://www.datainmotion.dev/p/links.html https://www.tutorialspoint.com/apache_nifi/apache_nifi_monitoring.htm https://community.cloudera.com/t5/Community-Articles/Building-a-Custom-Apache-NiFi-Operations-Dashboard-Part-1/ta-p/249060 https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi/nifi-metrics-reporting-nar/1.11.4/org.apache.nifi.metrics.reporting.task.MetricsReportingTask/ https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi/nifi-scripting-nar/1.11.4/org.apache.nifi.reporting.script.ScriptedReportingTask/index.html
... View more
05-12-2020
11:47 AM
You should not use Flume. Flume and it's connectors are deprecated. This and any flow can easily move to NiFi. https://dev.to/tspannhw/migrating-apache-flume-flows-to-apache-nifi-jms-to-x-and-x-to-jms-1g02
... View more
05-08-2020
12:03 PM
Awesome. Good luck with NiFi.
... View more