Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Data transformations at Spark level

avatar
New Contributor

Hello,

 

I'm trying to work with Spark and Cassandra to extract data from the data lake and transform it. Transformation may be done before of after loading in Cassandra.

 

About transformations, I'm wondering: what would be the transformation tool that would allow me to do transformations without wondering of data storage? I mean that, if tomorrow I don't want to use Cassandra anymore but Hadoop, I would like that my transformations remain valid. So, I would like that my transformation tool works with Spark directly and that my transformation tool do not take care of the tool Spark works with.

 

Please, could you recommend me a tool that would work with Spark and do not take care of undernying tools like Cassandra et Hadoop?

 

Thanks 🙂

1 ACCEPTED SOLUTION

avatar
Super Collaborator

In Spark a transformation works directly on the RDD. Transforms are implemented lazely and closely coupled to the RDDs. You can not use them separately.

What you are looking for is a tool that can generate Saprk code for you based on the transformation rule. I don't think that something like that exists.

 

Wilfred

View solution in original post

3 REPLIES 3

avatar
Super Collaborator

Do these transformations not work for you? Anything that you write in Spark can be adjusted to work with different storage underneath.

What else would you be looking for.

 

Wilfred

avatar
New Contributor

Hello,

 

Thanks for your answer.

Spark allows me to do some transformations but it is not the main goal of Spark. A transformation tool would offer me more capabilities and would be more productive for tons of transformation rules to be produced, case by case.

avatar
Super Collaborator

In Spark a transformation works directly on the RDD. Transforms are implemented lazely and closely coupled to the RDDs. You can not use them separately.

What you are looking for is a tool that can generate Saprk code for you based on the transformation rule. I don't think that something like that exists.

 

Wilfred