Created 05-13-2017 11:00 PM
Hi All,
I have requirement where I need to skip the file header on each load using pig.
Is there any way to skip the header row while processing? apart from using RANK?
Thanks,
Satish.
Created 05-14-2017 02:17 AM
What's the file type? If its CSV, you can use CSVExcelStorage. Otherwise, the following should work.
A = LOAD 'file_name' as (line:chararray);
B = FILTER A by $0>1;
See the following link for more options on this.
Created 05-14-2017 02:17 AM
What's the file type? If its CSV, you can use CSVExcelStorage. Otherwise, the following should work.
A = LOAD 'file_name' as (line:chararray);
B = FILTER A by $0>1;
See the following link for more options on this.
Created 05-14-2017 01:12 PM
Thanks for these alernative methods to achive the result.