Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Is there any case to prefer RDD instead of DataFrame/Dataset/SparkSQL in Spark 2.3.0 or later?

Highlighted

Is there any case to prefer RDD instead of DataFrame/Dataset/SparkSQL in Spark 2.3.0 or later?

Contributor

Is there any case to prefer RDD instead of DataFrame/Dataset/SparkSQLin Spark 2.3.0 or later?

AFAIK, using DataFrame/Dataset/SparkSQL has a lot of merits. For example, simpler coding and Optimization by Catalyst.

If there's some concrete example that should be written with RDD, let me know!

Thanks,