- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
How to Skip header row using Pig
- Labels:
-
Apache Pig
Created ‎05-13-2017 11:00 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi All,
I have requirement where I need to skip the file header on each load using pig.
Is there any way to skip the header row while processing? apart from using RANK?
Thanks,
Satish.
Created ‎05-14-2017 02:17 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
What's the file type? If its CSV, you can use CSVExcelStorage. Otherwise, the following should work.
A = LOAD 'file_name' as (line:chararray);
B = FILTER A by $0>1;
See the following link for more options on this.
Created ‎05-14-2017 02:17 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
What's the file type? If its CSV, you can use CSVExcelStorage. Otherwise, the following should work.
A = LOAD 'file_name' as (line:chararray);
B = FILTER A by $0>1;
See the following link for more options on this.
Created ‎05-14-2017 01:12 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Thanks for these alernative methods to achive the result.
