Member since
02-11-2022
21
Posts
1
Kudos Received
3
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
2575 | 01-03-2023 01:57 PM | |
1311 | 01-03-2023 01:56 PM | |
4863 | 02-14-2022 10:08 AM |
02-14-2022
10:14 AM
HI I want to configure ElasticSearchLookupService controler service in order to get some info in my index. Is there something alse I should to add ? Thanks in advance.
... View more
Labels:
- Labels:
-
Apache NiFi
02-14-2022
10:08 AM
I used GetHDFSFileInfo to get the numbe of incoming files with hdfs.count.files attribute Then at the end of the dataflow I move the processed files into a separate folder so only files to merge stay in the root folder. Thanks to @OliverGong for the hint 🙂
... View more
02-14-2022
05:49 AM
Hi @OliverGong Thanks a lot for your helpful answer. It increments the fragment.index atrribute until the BatchSize parameter value. It works when I kow how much files I want to merge so I set the value in the BatchSize variable But when I don't know how many files to merge (from business users) the fragment.count is not set correclty. Is there a way to get dynamically the number of incoming files?
... View more
02-14-2022
02:45 AM
I think about Groovy script but did not find how to loop each flowfile or how to get the count of the files
... View more
02-14-2022
02:43 AM
Hi @araujo Thanks a lot for you implication. That helped me to assign the metric attribute. But not for fragment.index attribute because I might have more than one file coming from the same directory so I should assign different fragment.index for each one. I also need to count the number of incoming files in order to assign the fragment.count attribute.
... View more
02-14-2022
01:04 AM
Hi Is there a way to get the number of the files in the input then assign the count value to an attribute and assign a number value to each file I have files to merge using MergeContent so I should assign fragment.index for each file and fragment.count as total count of files to merge.
... View more
Labels:
- Labels:
-
Apache NiFi
02-13-2022
10:50 AM
Hi @mburgess Is there a way to get the number of the files in the input then assign the count value to an attribute and assign a number value to each file I have files to merge using MergeContent so I should assign fragment.index for each file and fragment.count as total count of files to merge.
... View more
02-12-2022
06:36 PM
Hi @araujo thanks for your reply This an example: I have six csv files:. file1.csv, file2.csv, file3.csv, file4.csv have the same structure file5.csv, file6.csv have a different structure but the have some common columns that I will use in the QueryRecord In order to use The MergeContent, I should give a different.index attribute to each filename, it should be between 0 and 5 (as I have 6 files ). Before the MergeContent, I use ListHDFS >> FechHDFS >> UpdateAttribute 6 times (for each file) which is not a good design as I can have more than 6 files in the future, UpdateAttribute is where I assign the frangment.index attribute for each file. My question is, is there a way to have ONE ListHDFS >> FechHDFS >> UpdateAttribute that get all files and assign a different frangment.index for each file (between 0 and 5) in one UpdateAttribute processor For your question about the QueryRecord: I give a "metric"parameter for the 4 first files and another to two others in UpdateAttribute processor then in the QueryRecord I use this kind of query: select file1.col1, file1.col2, file2.col3, file2.col4,file3.col5,file3.col6
from (
select ID, file1.col1, file1.col2 where m = 'a'
) file1
left join (
select ID, file2.col3, file2.col4 from FLOWFILE where m = 'b'
) file2 on file1.ID_ART = file2.ID_ART
left join (
select ID, file3.col5,file3.col6 from FLOWFILE where m = 'c'
) file3 on file1.ID = file3.ID
... View more
- « Previous
-
- 1
- 2
- Next »