About stevenmatison

stevenmatison · ‎02-29-2020

Assuming you used the same certs, or even if you user different ones; you should be able to click the lock in any SSL based UIs (Ambari, Ranger, Nifi, yarn, grafana, etc) in your browser. This will show you all details of the certs including expiration dates.

justenji · ‎02-24-2020

@stevenmatison For the sake of completeness and to confirm your suspicions: With NiFi 1.11.1 parameters also work in the PutEmail-Processor.

Slade · ‎02-19-2020

Ok so me and my mate figured it out :). Finally! its like this $.outputs[?(@.name=="EM_CLASSIFICATION")].value

JohnYaya · ‎02-18-2020

Well, I'm trying to put together a fairly generic service to ingest files. One of the reqs is to be able to strip off any header or trailer lines. Files can come from any number of sources, so I don't think there will be any kind of pattern for a regex.

ask_bill_brooks · ‎02-13-2020

Hi @vikrant_kumar24, Looking over your most recent post it appears that @stevenmatison solved your issue. Once you've had a chance to try out the files he's provided, can you confirm by using the Accept as Solution button which can be found at the bottom of his reply so it can be of assistance to others?

AndyTech · ‎02-12-2020

Thanks @stevenmatison I am using Parquet format, I tried with ORC not a significant difference, then I changed following setting as follows: Not knowing a lot on the following settings but based on my research. I am not using partitions yet. set hive.cbo.enable=true; set hive.compute.query.using.stats=true; set hive.stats.fetch.column.stats=true; set hive.stats.fetch.partition.stats=true; set hive.vectorized.execution = ture set hive.vectorized.execution.enabled = true also I changed following execution engine set hive.execution.engine = spark I think changing engine to spark made a lot of difference.... Now query is running from 2.48 min to 15 sec I am quite satisfied with current performance but I would sure appreciate other advise for me and for the community. Thanks and appreciate you response. Andy

stevenmatison · ‎02-12-2020

@Peruvian81 I would delete only enough to get nifi re-started. Then I would want to go into the flow and look at what has caused it to fill up. This is of course assuming you have enough space in the drive to begin with. Next, I would recommend you should address nifi documented steps for disk configuration, and based on your flow, expand the content repository if necessary and if possible. Last thing to consider: your flow may just need to terminate large flow files when they are completed at the end of the line. If these are held in Q, and no longer needed, they are taking up valuable space.

stevenmatison · ‎02-12-2020

@AarifAkhter You need to create permissions within mysql for your ranger user. An example of this is: CREATE DATABASE ranger; CREATE USER 'ranger'@'hdp.cloudera.com' IDENTIFIED BY 'ranger'; GRANT ALL PRIVILEGES ON *.* TO 'ranger'@'hdp.cloudera.com' WITH GRANT OPTION; FLUSH PRIVILEGES; where you replace in your hostname for ranger in place of hdp.cloudera.com above: ip-xxx-xx-xx-xx.ec2.internal I would also recommend to install using FQDNs (Fully Qualified Domain Names). Please accept this answer as the solution to close the topic.

stevenmatison · ‎02-04-2020

@wengelbrecht thank you that is exactly what i needed to see. I am having an issue with the parquet-hadoop-1.10 and need to get a 1.12 version working in NiFi and Hive....

Manus · ‎02-04-2020

Hi all, Above solution is failing at one scenario, Scenario: if multiple flow files processed at a time and landed in the nifi queue which is used after update query ( i.e. puthiveql which increment processed_file_cnt by one for every flow file ) processor ,then there might be chances of triggering the next flow multiple times and that is wrong. Because we do select processed_file_cnt first and then doing the comparison for processed_file_cnt with input_file_cnt.

Online	Offline
Last Visited	‎06-01-2022 03:47 PM

Name	Steven Matison
Location	Florida
Member Since	‎07-19-2018 04:45 PM
Last Visited	‎06-01-2022 03:47 PM
Posts	613
Kudos received	101

Cloudera Community

Re: Apache nifi - how to convert a file .txt into ...

Re: Apache Nifi - Using PutParquet, the HDFS file ...

Re: How to extract csv column record and used it f...

Re: Could not connect to Distributed Map Cache ser...

Re: NiFi InvokeHTTP POST JSON

Re: SSL Certificates validation for hadoop compone...

Re: NiFi 1.10.0 - PutEmail - Error on using parame...

Re: How do I extract the attributes from the bel...

Re: NIFI - remove header and footer lines from CSV

Re: NiFi: How to enter CSV file content and meta d...

Re: how do I make that external hive table perform...

Re: NiFi disk Full

Re: how to resolve Can't establish db connection.....

Re: Apache NiFi 1.10: Support for Parquet RecordRe...

Re: How to implement the given problem statement i...