Created on 01-11-2017 05:39 AM - edited 09-16-2022 03:53 AM
Hi,
I was going through the practical exam for HDPCD in AWS. There is a question in pig as
"Remove all rows in the flightdelays data where the DepTime
column equals the string "NA".
The answer provided for this question says:
b = filter a by (chararray) $4 != 'NA' or (chararray) $11 != 'NA';
I do not understand why It checks 2 fields here. Is it correct or am I missing something ? Because of this I find differences in the actual output and my answer.
Please help me to understand if anyone has idea about it.
Created 01-23-2017 02:08 PM
There is a typo in the solution.
The correct line in the solution should be
b = filter a by (chararray) $4 != 'NA';
Thanks
Created 01-23-2017 02:08 PM
There is a typo in the solution.
The correct line in the solution should be
b = filter a by (chararray) $4 != 'NA';
Thanks
Created 01-24-2017 01:05 AM
Thank you!