Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

How to extact two fields( more than one) in pig nested foreach

avatar
Expert Contributor

Hi All,

According to my requirement i need script like following

A = load '/bsuresh/sample' USING PigStorage(',') as (id,name,sal,deptid);
 B = GROUP A by deptid;
 C = foreach B {
         D = A.name,A.sal;--two fields
	 E = DISTINCT D;
         generate group,COUNT(E);
};

In relation 'D', i am extracting two fields.Where exactly i am facing error.

If i chnaged the script like,which is working fine.

C = foreach B {
         D = A.name; --one filed
 	 E = DISTINCT D;
        generate group,COUNT(E);
};

But i need count based on distinct of two columns .Can any one help me??

1 ACCEPTED SOLUTION

avatar
Master Mentor
3 REPLIES 3

avatar
Master Mentor

@Suresh Bonamhave you looked at Concat?

avatar
Expert Contributor

Yeah ...@Artem Ervits i got your point.simple but logical.

avatar
Master Mentor

@Suresh Bonam let me know if that works for you and close the thread :).