09-03-2014 07:00 AM - last edited on 09-03-2014 09:44 AM by jkestelyn
I need to extract data from an Oracle DB table that's partition by hour. I have setup Sqoop2 connection and can run the job manually. WHat is the best way to automate the extarct to run at certain intervals?
Thanks in advance,
09-04-2014 06:40 AM
I suggest looking at Oozie. Oozie is the defacto workflow manager/scheduler and is supported on CDH. You can schedule jobs to be run at certain time intervals, at certain dates/times, etc.
Oozie supports Sqoop jobs. Here is some documentation, although I have no idea how up-to-date it is :)
Hope that helps!