Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

How to Skip header row using Pig

avatar
Expert Contributor

Hi All,

I have requirement where I need to skip the file header on each load using pig.

Is there any way to skip the header row while processing? apart from using RANK?

Thanks,

Satish.

1 ACCEPTED SOLUTION

avatar
Super Guru
@Satish S

What's the file type? If its CSV, you can use CSVExcelStorage. Otherwise, the following should work.

A = LOAD 'file_name' as (line:chararray);

B = FILTER A by $0>1;

See the following link for more options on this.

https://community.hortonworks.com/questions/74738/pig-error-error-orgapachepigtoolsgruntgrunt-error....

View solution in original post

2 REPLIES 2

avatar
Super Guru
@Satish S

What's the file type? If its CSV, you can use CSVExcelStorage. Otherwise, the following should work.

A = LOAD 'file_name' as (line:chararray);

B = FILTER A by $0>1;

See the following link for more options on this.

https://community.hortonworks.com/questions/74738/pig-error-error-orgapachepigtoolsgruntgrunt-error....

avatar
Expert Contributor

Thanks for these alernative methods to achive the result.