- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
CDP Public - install R and SparkR
- Labels:
-
Cloudera Data Platform (CDP)
Created 05-16-2022 03:09 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
is there any documentation to install R and SparkR in a gateway node of a DataHub?
I have CM 7.5.2 and a CDP Public subscription.
Spark version currently configured:
spark 2.4.8
spark 3.1.2
Thank you
Andrea
Created 05-17-2022 08:42 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi Andrea,
There is not much different from how you would install R and SparkR on any other host. The steps to install would be the same. How that R runtime integrates with CDP DataHub would need to be configured after the installation. Instead of SparkR, I would actually recommend you use sparklyr, as it's more compatible with Spark.
Regards,
Alex
Created 05-17-2022 08:42 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi Andrea,
There is not much different from how you would install R and SparkR on any other host. The steps to install would be the same. How that R runtime integrates with CDP DataHub would need to be configured after the installation. Instead of SparkR, I would actually recommend you use sparklyr, as it's more compatible with Spark.
Regards,
Alex
Created 05-19-2022 11:32 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi Alex,
thank you for having confirmed that. I'll proceed as you suggest.
Regards
Andrea
