Support Questions

Find answers, ask questions, and share your expertise

Sqoop Merge in Hive External table with date partitions

avatar

Problem Description : Sqoop is pushing data daily from MSSQL to HDFS location in date format like year/month/day (one pull daily for delta of data) and on top of it, created external Hive table. But Past data at source RDBMS is getting updated frequently so sqoop has to pull and merge those changes with already pulled data in HDFS. With that, I want to delete data that is 60 days old. Is there any way to achieve it. Can partitions in Hive resolve it. 

1 ACCEPTED SOLUTION

avatar
Expert Contributor
1 REPLY 1

avatar
Expert Contributor