03-24-2016 10:53 AM
I'm running into an issue with my RDD where I persist it (and use a count() to activate the persistence) and the entire RDD doesn't end up in memory until I query the RDD multiple times. This makes the first few runs extremely slow. Has anyone run into this issue and if so how did you fix it?
03-24-2016 11:01 AM