Support Questions

Find answers, ask questions, and share your expertise
Announcements
Check out our newest addition to the community, the Cloudera Data Analytics (CDA) group hub.

How to extact two fields( more than one) in pig nested foreach

Rising Star

Hi All,

According to my requirement i need script like following

A = load '/bsuresh/sample' USING PigStorage(',') as (id,name,sal,deptid);
 B = GROUP A by deptid;
 C = foreach B {
         D = A.name,A.sal;--two fields
	 E = DISTINCT D;
         generate group,COUNT(E);
};

In relation 'D', i am extracting two fields.Where exactly i am facing error.

If i chnaged the script like,which is working fine.

C = foreach B {
         D = A.name; --one filed
 	 E = DISTINCT D;
        generate group,COUNT(E);
};

But i need count based on distinct of two columns .Can any one help me??

1 ACCEPTED SOLUTION

Mentor
3 REPLIES 3

Mentor

@Suresh Bonamhave you looked at Concat?

Rising Star

Yeah ...@Artem Ervits i got your point.simple but logical.

Mentor

@Suresh Bonam let me know if that works for you and close the thread :).

Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.