Created on 10-21-2020 01:11 AM - edited 10-21-2020 01:53 AM
Problem Description : Sqoop is pushing data daily from MSSQL to HDFS location in date format like year/month/day (one pull daily for delta of data) and on top of it, created external Hive table. But Past data at source RDBMS is getting updated frequently so sqoop has to pull and merge those changes with already pulled data in HDFS. With that, I want to delete data that is 60 days old. Is there any way to achieve it. Can partitions in Hive resolve it.
Created 10-22-2020 07:16 AM
Here's a relevant community answer...
https://community.cloudera.com/t5/Support-Questions/How-to-use-merge-in-sqoop-import/td-p/161847
Mike
Created 10-22-2020 07:16 AM
Here's a relevant community answer...
https://community.cloudera.com/t5/Support-Questions/How-to-use-merge-in-sqoop-import/td-p/161847
Mike