Support Questions
Find answers, ask questions, and share your expertise
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Fail the pipeline on any bad record

Fail the pipeline on any bad record


I am getting multiple CSV files as input and in that there is a date column(Joining_Date) in all the files. Requirement is to validate the date as check if the date is in proper format. Pass the files if the records are valid and fail the whole pipeline if any of the record is abd in any of the file.


Re: Fail the pipeline on any bad record

Super Guru
@rajat puchnanda

Use Validate Record processor to check the format and configure the processor with avro logical type for date and timestamp.

Use only the valid relation to further processing and invalid relation to fail the pipeline.

Refer to these links for validate record, avro logical type and avro logical types

Don't have an account?
Coming from Hortonworks? Activate your account here