Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Remove Duplicate Record values without cache services

avatar
Explorer

Hi All, Good Morning!

 

Please help me in removing Duplicate Records without using cache services (Apache NIFI)Error LogError Log

1 ACCEPTED SOLUTION

avatar
Explorer
hide-solution

This problem has been solved!

Want to get a detailed solution you have to login/registered on the community

Register/Login
11 REPLIES 11

avatar

Not sure if this is the best way but try using QueryRecord and create one dynamic property to funnel the data into where your query will look like this:

Select distinct * from FLOWFILE

avatar
Explorer

Thanks for the valuable information and I got the results , but the Date and Error Message are not sorted/ordered properly.

Do We have any particular Function on Sql Query or any processor to sort in Order for the same

avatar

have you tried Selected distinct * order by Date, ERROR_MSG

avatar
Explorer
hide-solution

This problem has been solved!

Want to get a detailed solution you have to login/registered on the community

Register/Login

avatar
Explorer

Thanks for the help SAMSAL and my next task is to insert these values into the table. Please let me know which processor would help me. 

avatar

there are a lot of processors that can help you with that for example : PUTSQL, PutDatabaseRecord, ExecuteSQL

avatar
Explorer

Sure SAMSAL, Once the table is created I would test the same with above mentioned. Which is HiveDB Table.

avatar
Explorer

The table has been created, I have to perform Insert Statement on PutHiveStreaming Processor, Insert those Extracted Error Log Data. I am viewing results on Google and If I need any Help will get back here for sure, Thanks

avatar
Explorer

Please help me in Inserting these data into Hive DB. (Process flow). Hive doesn't accept CSV and it has to convert to Avro. I am stuck on how to Put Insert Statement and direct to PutHiveStreaming.