Member since
05-30-2018
1322
Posts
715
Kudos Received
148
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 4067 | 08-20-2018 08:26 PM | |
| 1962 | 08-15-2018 01:59 PM | |
| 2390 | 08-13-2018 02:20 PM | |
| 4138 | 07-23-2018 04:37 PM | |
| 5045 | 07-19-2018 12:52 PM |
03-13-2017
07:54 PM
2 Kudos
I am using NiFi for my data flow and then I kick off ETL script which runs many (hive/Pig) MR/Tez jobs. Is there easy way to detect (ie trigger) once the job has finished. Creating a trigger manually per job is not scalable since this are many jobs. Going into each job and have it create a trigger is off the table.
... View more
Labels:
03-13-2017
04:14 AM
are you still getting the error?
... View more
03-13-2017
01:54 AM
HDP service logs are available in ambari log search. the back end is solr so you can pull all or only relevant info based on your requirements. Also for service level metrics, ambari stores these now in grafana.
... View more
03-13-2017
01:40 AM
if you are looking for simple storage and analytics on logs then HDFS if you are looking for low latency reads/writes on log events then phoenix/hbase for cyber security, metron+nifi+hdfs For searching on logs, solr For low latency reads/writes and searching, HBase+solr (using lily indexer)
... View more
03-13-2017
01:31 AM
1 Kudo
Have you confirmed that error messages exist on the bulletin? Also... NiFi logs (errors) are stored in ambari solr infra instance. You can grab all errors/warnings from ambari-infra and push to postgres.
... View more
03-13-2017
01:23 AM
1 Kudo
Please set compression codec to NONE in the putHDFS process.
... View more
03-13-2017
01:18 AM
1 Kudo
Documentation for PutHiveStreaming requires flow file to be in avro format. My understanding is hive streaming only support ORC format. When puthivestreaming is used, does it convert avro to orc prior to inserting into hive table? trying to understand the functionality.
... View more
Labels:
- Labels:
-
Apache NiFi
03-13-2017
01:14 AM
2 Kudos
The documentation for ListFile states: If the primary node changes, the new Primary Node will pick up where the previous node left off without duplicating all of the data. How does the "new" primary node pick up where the previous node left off without flow file duplication? I ask since the previous primary node may have the file flow, when new primary node is elected how does it primary node get the flow file without duplicating or cloning it?
... View more
Labels:
- Labels:
-
Apache NiFi
03-10-2017
04:24 PM
I have cluster which is NOT kerberized. Is it possible to enable user impersonation for hive queries run from zeppelin? All the documentation seems to require user princple.
... View more
Labels:
- Labels:
-
Apache Hive
-
Apache Zeppelin