Member since
01-11-2016
355
Posts
230
Kudos Received
74
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
8190 | 06-19-2018 08:52 AM | |
3146 | 06-13-2018 07:54 AM | |
3574 | 06-02-2018 06:27 PM | |
3876 | 05-01-2018 12:28 PM | |
5397 | 04-24-2018 11:38 AM |
03-22-2018
01:47 PM
Hi @rajdip chaudhuri Have you considered NiFi? you have out of the box processors to list/fetch files and to write to HDFS. You can also use a NiFi cluster if you want to distribute the load on several nodes.
... View more
03-20-2018
10:01 AM
1 Kudo
Hi @Jayendra Patil Setting the optimal value of max thread count depends on your use cases and what processors you are using (CPU intensive like convert processor or IO intensive like the put/get processors). I've seen better usage of my hardware by having thread count around 2x number of cores. I've seen some cluster with 3x number of cores. I think you can go beyond 50 in your case and monitor the behavior. The best thing to do is to proceed in an incremental manner. I hope this helps. Abdelkrim
... View more
03-20-2018
09:33 AM
1 Kudo
Hi @dhieru singh AmbariReportingTask can be used to send metric to AMS. You can see the GC metrics that it can send to AMS : https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi/nifi-ambari-nar/1.5.0/org.apache.nifi.reporting.ambari.AmbariReportingTask/additionalDetails.html In the default Grafana dashboard, the information is not used. But you can create a dashboard to show jvm.gc.runs.G1 Young Generation for example. Below a simple dashboard that show this information:
... View more
03-17-2018
06:45 PM
Hi @Karl Fredrickson If you have Knox you can use it to encapsulate Kerberos authentication and use username/password. Thanks
... View more
03-17-2018
06:33 PM
1 Kudo
Hi What error are you facing? If it's HTTP authorization error, then check Ranger audit to understand what's happening. Make sure that you have the right Ranger policies for UI and API.
... View more
03-17-2018
06:30 PM
Hi @David Manukian This seems to be a virtualbox network configuration issue. Check your port forwarding and add a rule for 8080.
... View more
03-07-2018
06:24 PM
Hi @Eric Lloyd I am not sure I understand your use case. NiFi tails a local file. From your question, it looks like you are trying to tail the same fail when master switch. Is your file visible to both nodes (such as NAS storage) ? TailFile saves it's state to avoid duplicating data from one file. There's two option to store the state : local and remote. Have you set "state location" to remote ? As per the doc : Specifies where the state is located either local or cluster so that state can be stored appropriately in order to ensure that all data is consumed without duplicating data upon restart of NiFi
... View more
02-22-2018
05:09 PM
1 Kudo
Hi @spdvnz Your input and output CSV schema for the LookupRecord should be different. In the output schema you should add a field 'Company' that the processor will populate. Take a look at this example where I added the field city in the output schema : https://medium.com/@abdelkrim.hadjidj/data-flow-enrichment-with-apache-nifi-d221f1dde419
... View more
02-07-2018
05:50 PM
1 Kudo
Hi @Cesar Rodrigues The cleaniest way should be to use ConvertRecord processor with a CSVReader (using Delimiter as pipe) and JSonSetRecordWriter. This directly convert your CSV into JSON without passing by attributes. Using Record processors also gives you better performance. Thanks
... View more
02-03-2018
07:17 AM
Thanks @David Doran If you found that this answer addressed your question,
please take a moment to click "Accept" below.
... View more