Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Handing nullls as alias at pig level

avatar
Expert Contributor

Hi All,

I have my table structure and in hive like below .

Create table hv (x int,y int,z int ,w int )

PIGLATIN used to load the table

A = LOAD 'XYZ.CSV.GZ' using pigstorage AS (x,y,w)

this time Z col is missing from the file so i wan to keep the value of z as null in PIGLATIN and i donot want to change my hive table structure .How to generate the values as null for this column in pig??

1 ACCEPTED SOLUTION

avatar
Super Collaborator

Hi @Vaibhav Kumar,

If you want to create a bag matching target table's structure, you can do as following:

a = load 'file.csv' as PigStorage(',') as (x,y,w);
b = foreach a generate x, y, (int)null as z, w;
describe b;
--    b: {x: int,y: int,z: int,w: int}

View solution in original post

1 REPLY 1

avatar
Super Collaborator

Hi @Vaibhav Kumar,

If you want to create a bag matching target table's structure, you can do as following:

a = load 'file.csv' as PigStorage(',') as (x,y,w);
b = foreach a generate x, y, (int)null as z, w;
describe b;
--    b: {x: int,y: int,z: int,w: int}