Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

I need to create a spark streaming application which should monitor a hdfs path continuously and read only the new incoming csv data in csv tabular format.

avatar
New Member
 
1 ACCEPTED SOLUTION

avatar
New Member

Thanks @Felix Albani. I found it. For Scala code pls check this git hub link https://github.com/phatak-dev/spark2.0-examples/blob/master/src/main/scala/com/madhukaraphatak/examp.... This is exactly what i needed.

View solution in original post

3 REPLIES 3

avatar

@Sudharsan Ganeshkumar

Out of the box spark provides the fileStream. You can read more here:

https://spark.apache.org/docs/latest/streaming-programming-guide.html

HTH

*** If you found this answer addressed your question, please take a moment to login and click the "accept" link on the answer.

avatar

@Sudharsan Ganeshkumar if my answer has helped you please remember to login and mark it as accepted.

avatar
New Member

Thanks @Felix Albani. I found it. For Scala code pls check this git hub link https://github.com/phatak-dev/spark2.0-examples/blob/master/src/main/scala/com/madhukaraphatak/examp.... This is exactly what i needed.