Support Questions
Find answers, ask questions, and share your expertise

Is there any case to prefer RDD instead of DataFrame/Dataset/SparkSQL in Spark 2.3.0 or later?

Is there any case to prefer RDD instead of DataFrame/Dataset/SparkSQL in Spark 2.3.0 or later?

Contributor

Is there any case to prefer RDD instead of DataFrame/Dataset/SparkSQLin Spark 2.3.0 or later?

AFAIK, using DataFrame/Dataset/SparkSQL has a lot of merits. For example, simpler coding and Optimization by Catalyst.

If there's some concrete example that should be written with RDD, let me know!

Thanks,

Don't have an account?