Member since
11-16-2015
893
Posts
652
Kudos Received
245
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
6670 | 02-22-2024 12:38 PM | |
1510 | 02-02-2023 07:07 AM | |
3309 | 12-07-2021 09:19 AM | |
4384 | 03-20-2020 12:34 PM | |
14830 | 01-27-2020 07:57 AM |
11-09-2016
02:12 PM
Thanks @Andrew Grande! That worked! I feel like a noob 🙂 but appreciate all the help!
... View more
11-08-2016
08:11 PM
1 Kudo
Avro doesn't like the dots in the attribute names (which become field names), perhaps you could rename them (with UpdateAttribute or at the source processor(s)) using underscores (or other valid characters like alphanumeric characters, see the link for the rules).
... View more
03-16-2018
03:15 PM
now use record processor
... View more
10-27-2016
09:44 PM
@mclark @Matt Burgess so in any case does the file needs to be read in to memory before it splits.?? either by lines or by bytes. i was hoping it starts the next process work once it receive first split.? in my case it waited 8 minutes until it split the 10GB file into 1200+ splits. If my files are about 100 GB each (I have 18 such files) I am scared to run the whole flow for all files. I may have to run for each file one by one.?
... View more
12-14-2018
03:53 PM
I am getting same error in HDP 3.
... View more
10-04-2016
01:08 PM
1 Kudo
Great answer! Just to add a caveat, if you are using HDF 2.0 and HDP 2.5, please see the following: https://community.hortonworks.com/questions/59681/puthivestreaming-nifi-processor-various-errors.html
... View more
10-04-2016
04:09 PM
@Matt Burgess 2 things Resolved the issue: 1. start with the "jdbc:hive2" prefix jdbc:hive2://host.name.net:10000/;principal=hive/_HOST@EXAMPLE.COM 2. Add following property to hive-site.xml that is passed under HiveConnectionPool "Hive Configuration Resources" property. <property>
<name>hadoop.security.authentication</name>
<value>kerberos</value>
</property>
... View more
08-17-2017
10:54 AM
How do you not use Zookeeper for service discovery. Please give me step by step instructions if possible. Tried all recommended methods online with no luck. Thank you.
... View more
09-09-2016
07:21 PM
The PutElasticsearch processor uses the Transport API for Elasticsearch, not the HTTP API. This means your port should be 9300 not 9200. The "Identifier Attribute" property is the name of a flow file attribute that contains a unique identifer for the document. If you don't have an identifier you want to use, you can put "uuid", this will use the flow file's UUID as the identifier for the Elasticsearch document. If you do have an identifier for the document, put its value (using UpdateAttribute, EvaluateJsonPath, etc.) into a flow file attribute, and put that attribute's name in the "Identifier Attribute" property. Note you don't use Expression language here, so if your attribute's name is "doc_id", you put "doc_id" in the Identifier Attribute property, not "${doc_id}"
... View more
12-22-2016
01:35 PM
@David Kjerrumgaard I found out the issue was with my csv file. It had to do with the column names i was ingesting in the csv file. Also, the turning the Obtain Generated Keys property to false worked as well in my case.
... View more