Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Correct Way to Create Feature Lists from Input Stream

Correct Way to Create Feature Lists from Input Stream

New Contributor

I want to know what is the right way to create a feature list for each element of input stream.  I have more than 100 featues that I want to create for each input data that we are getting from the Kafka spark stream.  Since the features are a lot I want to run these features in parallel.  Becasue I cannot call RDD inside another RDD.  Should I write my own multithreaded application to process features.

 

Please guide.

 

Thanks,

 

Rachana