Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Multiple tables export with Sqoop in a Oozie workflow

avatar

Hi,

What is the best way to have an Oozie action performing a multiple tables export using Sqoop?

Context:

  • I have a HDFS directory containing one file per destination table but the number of files is not constant over time
  • The filename allows me to know what is the name of the target table

I'd like to have one single Oozie action calling Sqoop to export all the files in the target database.

Is it something we can do? What are the available options?

1 ACCEPTED SOLUTION

avatar
Master Guru

Ah just read that its an export. But again I don't think there is any automated way to do that.

So I suppose your only choice is to make a custom shell/ssh action that runs a script ( shell, python ... ) that lists the files from the directory and then executes sqoop jobs for them. Would be my approach.

View solution in original post

2 REPLIES 2

avatar
Master Guru

Ah just read that its an export. But again I don't think there is any automated way to do that.

So I suppose your only choice is to make a custom shell/ssh action that runs a script ( shell, python ... ) that lists the files from the directory and then executes sqoop jobs for them. Would be my approach.

avatar

Hi @Benjamin Leonhardi, thanks for your answer, it confirms what I read so far. I'll go that way, thanks again.