Hi,
We are processing ZIP file contains multiple timestamp files (.hiveql,.csv) in distributed manner .
We check the file extension whether it is .hql or .csv then we route the file to execute it PutHiveQL and PutHDFS processor respectively.
The files(timestamp order starts with for example t1 or system timestamp) below contains in ZIP file to be extracted and processed in order.
table_info.zip
table_info_t1.hql
table_info_t1_1.csv
table_info_t1_2.csv
table_info_t2.hql
table_info_t2_1.csv
table_info_t2_2.csv
table_info_latest.hql
table_info_latest.csv
Please find the below NiFi flow and RouteonAttribute property
Is there any way to make us to wait first puthivesql executes first and give indication to putHDFS execution next for each timestamp file one by one order.
Can we group each timestamp files into group and process the .hql file and the put .csv file into HDFS?
@Nifi