Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Load MYSQL table in to RDD

Solved Go to solution
Highlighted

Load MYSQL table in to RDD

Rising Star

How can I load a complete table to an RDD using Spark.

1 ACCEPTED SOLUTION

Accepted Solutions

Re: Load MYSQL table in to RDD

Expert Contributor

There is a JDBC RDD function:

newJdbcRDD(sc: SparkContext, getConnection: () ⇒ Connection, sql: String, lowerBound: Long, upperBound: Long, numPartitions: Int, mapRow: (ResultSet) ⇒ T = JdbcRDD.resultSetToObjectArray)(implicit arg0: ClassTag[T])

View solution in original post

2 REPLIES 2
Highlighted

Re: Load MYSQL table in to RDD

I'm not aware of direct connector to MySQL. You could use Sqoop to ingest the contents of your table into HDFS then use the SparkContext's textFile() method to load it as an RDD.

Re: Load MYSQL table in to RDD

Expert Contributor

There is a JDBC RDD function:

newJdbcRDD(sc: SparkContext, getConnection: () ⇒ Connection, sql: String, lowerBound: Long, upperBound: Long, numPartitions: Int, mapRow: (ResultSet) ⇒ T = JdbcRDD.resultSetToObjectArray)(implicit arg0: ClassTag[T])

View solution in original post

Don't have an account?
Coming from Hortonworks? Activate your account here