Support Questions

Find answers, ask questions, and share your expertise
Check out our newest addition to the community, the Cloudera Data Analytics (CDA) group hub.

Machine learning on large datasets

Rising Star

I am experience java programmer and want to shift in Data Science. I would like to apply machine learning algorithm on very very large data set (few TB to PB). Which language is preferable to use - Scala / Python / R ?


Rising Star

I recommend scala or python.

R support just stand alone so you have to use spark + R or python + R.

Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.