Support Questions

Find answers, ask questions, and share your expertise

How to Skip header row using Pig

avatar
Expert Contributor

Hi All,

I have requirement where I need to skip the file header on each load using pig.

Is there any way to skip the header row while processing? apart from using RANK?

Thanks,

Satish.

1 ACCEPTED SOLUTION

avatar
Super Guru
@Satish S

What's the file type? If its CSV, you can use CSVExcelStorage. Otherwise, the following should work.

A = LOAD 'file_name' as (line:chararray);

B = FILTER A by $0>1;

See the following link for more options on this.

https://community.hortonworks.com/questions/74738/pig-error-error-orgapachepigtoolsgruntgrunt-error....

View solution in original post

2 REPLIES 2

avatar
Super Guru
@Satish S

What's the file type? If its CSV, you can use CSVExcelStorage. Otherwise, the following should work.

A = LOAD 'file_name' as (line:chararray);

B = FILTER A by $0>1;

See the following link for more options on this.

https://community.hortonworks.com/questions/74738/pig-error-error-orgapachepigtoolsgruntgrunt-error....

avatar
Expert Contributor

Thanks for these alernative methods to achive the result.