09-22-2018 07:57 AM
I am using CDH 5.13 and I want to join my datasets so I created pairedRDD but when I see all the available methods for pair RDD I am unable to get joins methods. However when I try it in spark 2.x it works smoothly on pair RDDs as well. So why is it so ?? Is it a bug in this version of cloudera ?? Or how to fix it ?? Any help is appreciated coz I am practicing it on cloudera from exam perspective, so want to stick to it. Below is my pairRdd code along with available methods after creating it.
Currently incubating in Cloudera Labs:Envelope