About ahadjidj

ahadjidj · ‎03-22-2018

Hi @rajdip chaudhuri Have you considered NiFi? you have out of the box processors to list/fetch files and to write to HDFS. You can also use a NiFi cluster if you want to distribute the load on several nodes.

ahadjidj · ‎03-20-2018

Hi @Jayendra Patil Setting the optimal value of max thread count depends on your use cases and what processors you are using (CPU intensive like convert processor or IO intensive like the put/get processors). I've seen better usage of my hardware by having thread count around 2x number of cores. I've seen some cluster with 3x number of cores. I think you can go beyond 50 in your case and monitor the behavior. The best thing to do is to proceed in an incremental manner. I hope this helps. Abdelkrim

ahadjidj · ‎03-20-2018

Hi @dhieru singh AmbariReportingTask can be used to send metric to AMS. You can see the GC metrics that it can send to AMS : https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi/nifi-ambari-nar/1.5.0/org.apache.nifi.reporting.ambari.AmbariReportingTask/additionalDetails.html In the default Grafana dashboard, the information is not used. But you can create a dashboard to show jvm.gc.runs.G1 Young Generation for example. Below a simple dashboard that show this information:

ahadjidj · ‎03-17-2018

Hi @Karl Fredrickson If you have Knox you can use it to encapsulate Kerberos authentication and use username/password. Thanks

ahadjidj · ‎03-17-2018

Hi What error are you facing? If it's HTTP authorization error, then check Ranger audit to understand what's happening. Make sure that you have the right Ranger policies for UI and API.

ahadjidj · ‎03-17-2018

Hi @David Manukian This seems to be a virtualbox network configuration issue. Check your port forwarding and add a rule for 8080.

ahadjidj · ‎03-07-2018

Hi @Eric Lloyd I am not sure I understand your use case. NiFi tails a local file. From your question, it looks like you are trying to tail the same fail when master switch. Is your file visible to both nodes (such as NAS storage) ? TailFile saves it's state to avoid duplicating data from one file. There's two option to store the state : local and remote. Have you set "state location" to remote ? As per the doc : Specifies where the state is located either local or cluster so that state can be stored appropriately in order to ensure that all data is consumed without duplicating data upon restart of NiFi

ahadjidj · ‎02-22-2018

Hi @spdvnz Your input and output CSV schema for the LookupRecord should be different. In the output schema you should add a field 'Company' that the processor will populate. Take a look at this example where I added the field city in the output schema : https://medium.com/@abdelkrim.hadjidj/data-flow-enrichment-with-apache-nifi-d221f1dde419

ahadjidj · ‎02-07-2018

Hi @Cesar Rodrigues The cleaniest way should be to use ConvertRecord processor with a CSVReader (using Delimiter as pipe) and JSonSetRecordWriter. This directly convert your CSV into JSON without passing by attributes. Using Record processors also gives you better performance. Thanks

ahadjidj · ‎02-03-2018

Thanks @David Doran If you found that this answer addressed your question, please take a moment to click "Accept" below.

Online	Offline
Last Visited	‎08-19-2019 05:07 AM

Member Since	‎01-11-2016 06:11 PM
Last Visited	‎08-19-2019 05:07 AM
Posts	355
Kudos received	232

Cloudera Community

Re: How to access NIFI Process Group variable in E...

Re: GETSFTP with NiFi cluster

Re: how is Kafka different from Mosquitto(MQTT) ?

Re: Whitelisting using LookupAttribute

Re: Is there any ways if we can schedule or trigge...

Re: Copy large number of massive files from local ...

Re: How to configure NiFi to maximize the usage of...

Re: NiFi monitor garbage collection from Grafana

Re: Hive JDBC driver with keytab authentication

Re: Issues with Knox and Ambari

Re: There is no way to get ambari UI after install...

Re: Avoiding Duplicate data with Nifi TileFile pro...

Re: LookUpRecord and SimpleCsvFileLookupService in...

Re: [Nifi] Converting a delimited FlowFile's conte...

Re: Parameterized NiFi template handling