Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

How to Skip header row using Pig

avatar
Expert Contributor

Hi All,

I have requirement where I need to skip the file header on each load using pig.

Is there any way to skip the header row while processing? apart from using RANK?

Thanks,

Satish.

1 ACCEPTED SOLUTION

avatar
Super Guru
@Satish S

What's the file type? If its CSV, you can use CSVExcelStorage. Otherwise, the following should work.

A = LOAD 'file_name' as (line:chararray);

B = FILTER A by $0>1;

See the following link for more options on this.

https://community.hortonworks.com/questions/74738/pig-error-error-orgapachepigtoolsgruntgrunt-error....

View solution in original post

2 REPLIES 2

avatar
Super Guru
@Satish S

What's the file type? If its CSV, you can use CSVExcelStorage. Otherwise, the following should work.

A = LOAD 'file_name' as (line:chararray);

B = FILTER A by $0>1;

See the following link for more options on this.

https://community.hortonworks.com/questions/74738/pig-error-error-orgapachepigtoolsgruntgrunt-error....

avatar
Expert Contributor

Thanks for these alernative methods to achive the result.