Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Multiple tables export with Sqoop in a Oozie workflow

avatar

Hi,

What is the best way to have an Oozie action performing a multiple tables export using Sqoop?

Context:

  • I have a HDFS directory containing one file per destination table but the number of files is not constant over time
  • The filename allows me to know what is the name of the target table

I'd like to have one single Oozie action calling Sqoop to export all the files in the target database.

Is it something we can do? What are the available options?

1 ACCEPTED SOLUTION

avatar
Master Guru

Ah just read that its an export. But again I don't think there is any automated way to do that.

So I suppose your only choice is to make a custom shell/ssh action that runs a script ( shell, python ... ) that lists the files from the directory and then executes sqoop jobs for them. Would be my approach.

View solution in original post

2 REPLIES 2

avatar
Master Guru

Ah just read that its an export. But again I don't think there is any automated way to do that.

So I suppose your only choice is to make a custom shell/ssh action that runs a script ( shell, python ... ) that lists the files from the directory and then executes sqoop jobs for them. Would be my approach.

avatar

Hi @Benjamin Leonhardi, thanks for your answer, it confirms what I read so far. I'll go that way, thanks again.