Support Questions
Find answers, ask questions, and share your expertise

Sqoop Import job stuck at 99% when using JOIN in query

Sqoop Import job stuck at 99% when using JOIN in query

Rising Star

Here is the command that i use to Import data from RDS into S3 .

sqoop import \
-D mapreduce.map.memory.mb=6144 -D mapreduce.map.java.opts=-Xmx1024m \
--connect jdbc:mysql://a205067-pcfp-rds-abcd.dfgfdg.us-east-1.rds.amazonaws.com/tprdb \
--username tpruser \
--password Welcome12345 \
--query 'SELECT d.* from DnB_WB_UniverseMaster d join DnB_WB_UniverseMaster_Incr c on d.DunsNumber = c.DunsNumber where $CONDITIONS' \
--boundary-query "SELECT * FROM
    (
        SELECT
            MIN( DunsNumber ) min_
        from
            DnB_WB_UniverseMaster
    ) v1,
    (
        SELECT
            MAX( DunsNumber ) max_
        from
            DnB_WB_UniverseMaster
    ) v2" \
--split-by d.DunsNumber \
-num-mappers 100 \
--fields-terminated-by '|' \
--lines-terminated-by '\n' \
--target-dir s3://12345-pcfp-latest-new/output/processing/APCFP/IMPORT

When i run it last mapper is getting stuck at 99% and it does not move at all .

It always stuck at 99%.

Same thing when i do it for the single table then it works fine .

Can some one suggest the changes .