Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Has Cloudera given up on Pig?

avatar
New Contributor

Many of us Cloudera Enterprise users love Apache Pig because it makes it easy to build powerful and complex data transformation and integration pipelines that run as fault-tolerant batch jobs.  However, we are afraid that Cloudera may be abandoning our beloved Pig because CDH has not kept up with any of the latest versions of Pig, which offer many new helpful capabilities.  What can you tell us about Cloudera's plans for Pig?

 

Hortonworks HDP 2.5 contains Pig 0.16
MapR 5.2.0 contains Pig 0.15
IBM BigInsights 4.2.0 contains Pig 0.15
Cloudera CDH 5.8.0 contains Pig 0.12

1 ACCEPTED SOLUTION

avatar
Cloudera Employee

Hi,

 

My name is Santosh and I am the Product Manager for Pig at Cloudera.

Cloudera is _not_ abandoning Pig at all. Pig is widely used among our customers. We are fully committed to supporting it.


Upstream Pig versions are not the best way to think about CDH Pig. CDH 5.8 Pig contains base v0.12 + numerous features/enhancements/fixes including some from v0.16 provided these pass our stability and quality standards. That's because our customers have consistently told us that stability and quality are their highest priority when it comes to Pig. 


We have not rebased since v0.12 because v0.13 added support for pluggable compute engine (Tez) that caused a significant code churn and introduced instability. Next two releases v0.14 and v0.15 mostly stabilized Pig on Tez with very few additional features or enhancements. More recently v0.16 has been released which is a good candidate for rebase and work has already started on rebasing CDH Pig to v0.16.


To reiterate, Cloudera is fully committed to supporting Pig and making the most stable and reliable release of Pig available to them.

View solution in original post

6 REPLIES 6

avatar
Contributor

Hi Humana,

Rebasing to a higher version of Pig is on our roadmap. Unfortunately I can't provide specific guidance as to which release this will be included in and/or when that will be happening, however work is actively in progress. Rest assured however that we haven't given up on Pig!

 

Thanks,
MJ

avatar
Cloudera Employee

Hi,

 

My name is Santosh and I am the Product Manager for Pig at Cloudera.

Cloudera is _not_ abandoning Pig at all. Pig is widely used among our customers. We are fully committed to supporting it.


Upstream Pig versions are not the best way to think about CDH Pig. CDH 5.8 Pig contains base v0.12 + numerous features/enhancements/fixes including some from v0.16 provided these pass our stability and quality standards. That's because our customers have consistently told us that stability and quality are their highest priority when it comes to Pig. 


We have not rebased since v0.12 because v0.13 added support for pluggable compute engine (Tez) that caused a significant code churn and introduced instability. Next two releases v0.14 and v0.15 mostly stabilized Pig on Tez with very few additional features or enhancements. More recently v0.16 has been released which is a good candidate for rebase and work has already started on rebasing CDH Pig to v0.16.


To reiterate, Cloudera is fully committed to supporting Pig and making the most stable and reliable release of Pig available to them.

avatar
New Contributor

Santosh, thank you kindly for your reply.  I am sure that it is a relief for many of us to hear your good news.

 

Since you are the Product Manager for Pig at Cloudera, can you also please give us an update on your team's effort to make Pig run on Spark instead of MapReduce?

avatar
Explorer
Could you pls tell us when do you plan for update 0.12 -> 0.16 Pig in your CDH cluster? Thx

avatar
Explorer
And nothing has changed here (((, PIG has already done 17 Release https://pig.apache.org/releases.html
But in CDH we still have 0.12 related to 2013!

avatar
Contributor

While some time has passed, for those who subsquently find this thread, Pig 0.17.0 has been included in CDH 6:

https://www.cloudera.com/documentation/enterprise/6/release-notes/topics/rg_cdh_60_packaging.html#cd...