<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: why do I get error during sqoop import every time i will not set the mapper to 1, in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191969#M76290</link>
    <description>&lt;P&gt;Also, can you please share the "actual" MR job logs that you can see when you are running your job with multiple mappers?&lt;/P&gt;</description>
    <pubDate>Fri, 23 Mar 2018 11:34:08 GMT</pubDate>
    <dc:creator>RahulSoni</dc:creator>
    <dc:date>2018-03-23T11:34:08Z</dc:date>
    <item>
      <title>why do I get error during sqoop import every time i will not set the mapper to 1,</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191964#M76285</link>
      <description>&lt;P&gt;I'm importing 5.6 GB table&lt;/P&gt;&lt;P&gt;and the error:&lt;/P&gt;&lt;P&gt;Error: java.io.IOException: mysqldump terminated with status 2
at org.apache.sqoop.mapreduce.MySQLDumpMapper.map(MySQLDumpMapper.java:485)
at org.apache.sqoop.mapreduce.MySQLDumpMapper.map(MySQLDumpMapper.java:49)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:146)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:787)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:341)
at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:170)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:422)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1866)
at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:164)&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 13:01:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191964#M76285</guid>
      <dc:creator>dcedielunesa</dc:creator>
      <dc:date>2022-09-16T13:01:02Z</dc:date>
    </item>
    <item>
      <title>Re: why do I get error during sqoop import every time i will not set the mapper to 1,</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191965#M76286</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/66186/dcedielunesa.html" nodeid="66186"&gt;@Christian Lunesa&lt;/A&gt;&lt;P&gt;Can you please share your sqoop command? Are you using --direct by any chance?&lt;/P&gt;</description>
      <pubDate>Fri, 23 Mar 2018 10:36:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191965#M76286</guid>
      <dc:creator>RahulSoni</dc:creator>
      <dc:date>2018-03-23T10:36:39Z</dc:date>
    </item>
    <item>
      <title>Re: why do I get error during sqoop import every time i will not set the mapper to 1,</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191966#M76287</link>
      <description>&lt;P&gt;import -Dorg.apache.sqoop.splitter.allow_text_splitter=true --connect jdbc:mysql://x.x.x.x:xxxx/kpcustomers --username root --password ******* --table customers --fields-terminated-by | -m 5 --hive-import --hive-overwrite --hive-table testing.customers --direct --verbose&lt;/P&gt;</description>
      <pubDate>Fri, 23 Mar 2018 11:08:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191966#M76287</guid>
      <dc:creator>dcedielunesa</dc:creator>
      <dc:date>2018-03-23T11:08:35Z</dc:date>
    </item>
    <item>
      <title>Re: why do I get error during sqoop import every time i will not set the mapper to 1,</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191967#M76288</link>
      <description>&lt;P&gt;Are you using "split-by" column while not setting number of mappers to 1?&lt;/P&gt;</description>
      <pubDate>Fri, 23 Mar 2018 11:22:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191967#M76288</guid>
      <dc:creator>RahulSoni</dc:creator>
      <dc:date>2018-03-23T11:22:17Z</dc:date>
    </item>
    <item>
      <title>Re: why do I get error during sqoop import every time i will not set the mapper to 1,</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191968#M76289</link>
      <description>&lt;P&gt;yes I am using --direct&lt;/P&gt;</description>
      <pubDate>Fri, 23 Mar 2018 11:26:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191968#M76289</guid>
      <dc:creator>dcedielunesa</dc:creator>
      <dc:date>2018-03-23T11:26:41Z</dc:date>
    </item>
    <item>
      <title>Re: why do I get error during sqoop import every time i will not set the mapper to 1,</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191969#M76290</link>
      <description>&lt;P&gt;Also, can you please share the "actual" MR job logs that you can see when you are running your job with multiple mappers?&lt;/P&gt;</description>
      <pubDate>Fri, 23 Mar 2018 11:34:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191969#M76290</guid>
      <dc:creator>RahulSoni</dc:creator>
      <dc:date>2018-03-23T11:34:08Z</dc:date>
    </item>
    <item>
      <title>Re: why do I get error during sqoop import every time i will not set the mapper to 1,</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191970#M76291</link>
      <description>&lt;P&gt;NO, I don't use split-by when I set mappers to 1 or in any&lt;/P&gt;</description>
      <pubDate>Fri, 23 Mar 2018 11:46:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191970#M76291</guid>
      <dc:creator>dcedielunesa</dc:creator>
      <dc:date>2018-03-23T11:46:47Z</dc:date>
    </item>
    <item>
      <title>Re: why do I get error during sqoop import every time i will not set the mapper to 1,</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191971#M76292</link>
      <description>&lt;P&gt;here's the stderr and stdout&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.cloudera.com/legacyfs/online/attachments/64773-stdout.txt"&gt;stdout.txt&lt;/A&gt; &amp;amp; &lt;A href="https://community.cloudera.com/legacyfs/online/attachments/64772-stderr.txt"&gt;stderr.txt&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 23 Mar 2018 11:54:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191971#M76292</guid>
      <dc:creator>dcedielunesa</dc:creator>
      <dc:date>2018-03-23T11:54:04Z</dc:date>
    </item>
    <item>
      <title>Re: why do I get error during sqoop import every time i will not set the mapper to 1,</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191972#M76293</link>
      <description>&lt;P&gt;The import will succeed if I will set the mappers to 1. I also notice that when I'm not using 1 mapper the yarn memory will be fully consumed&lt;/P&gt;</description>
      <pubDate>Fri, 23 Mar 2018 11:57:03 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191972#M76293</guid>
      <dc:creator>dcedielunesa</dc:creator>
      <dc:date>2018-03-23T11:57:03Z</dc:date>
    </item>
    <item>
      <title>Re: why do I get error during sqoop import every time i will not set the mapper to 1,</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191973#M76294</link>
      <description>&lt;P&gt;So the issue is with the "PK" column used in distributing the data in case of multiple mappers. It has always been recommended that an "integral" column is used as the "split by" column and your import is trying to use the column "CustID" which is String. &lt;/P&gt;&lt;P&gt;Have a look at how your splits are calculated during the import.&lt;/P&gt;&lt;PRE&gt;8020 [main] WARN  org.apache.sqoop.mapreduce.db.TextSplitter  - You are strongly encouraged to choose an integral split column.
8025 [main] DEBUG org.apache.sqoop.mapreduce.db.DataDrivenDBInputFormat  - Creating input split with lower bound '`CustID` &amp;gt;= '1'' and upper bound '`CustID` &amp;lt; '3&amp;#14;?????''
8025 [main] DEBUG org.apache.sqoop.mapreduce.db.DataDrivenDBInputFormat  - Creating input split with lower bound '`CustID` &amp;gt;= '1'' and upper bound '`CustID` &amp;lt; '3&amp;#14;?????''
8025 [main] DEBUG org.apache.sqoop.mapreduce.db.DataDrivenDBInputFormat  - Creating input split with lower bound '`CustID` &amp;gt;= '3&amp;#14;?????'' and upper bound '`CustID` &amp;lt; '5&amp;#28;?????''
8025 [main] DEBUG org.apache.sqoop.mapreduce.db.DataDrivenDBInputFormat  - Creating input split with lower bound '`CustID` &amp;gt;= '3&amp;#14;?????'' and upper bound '`CustID` &amp;lt; '5&amp;#28;?????''
8025 [main] DEBUG org.apache.sqoop.mapreduce.db.DataDrivenDBInputFormat  - Creating input split with lower bound '`CustID` &amp;gt;= '5&amp;#28;?????'' and upper bound '`CustID` &amp;lt; '7*?????''
8025 [main] DEBUG org.apache.sqoop.mapreduce.db.DataDrivenDBInputFormat  - Creating input split with lower bound '`CustID` &amp;gt;= '5&amp;#28;?????'' and upper bound '`CustID` &amp;lt; '7*?????''
8025 [main] DEBUG org.apache.sqoop.mapreduce.db.DataDrivenDBInputFormat  - Creating input split with lower bound '`CustID` &amp;gt;= '7*?????'' and upper bound '`CustID` &amp;lt;= '999999''
8025 [main] DEBUG org.apache.sqoop.mapreduce.db.DataDrivenDBInputFormat  - Creating input split with lower bound '`CustID` &amp;gt;= '7*?????'' and upper bound '`CustID` &amp;lt;= '999999''
8068 [main] INFO  org.apache.hadoop.mapreduce.JobSubmitter  - number of splits:4&lt;/PRE&gt;&lt;P&gt;The "?" indicates some foreign characters probably not parsed properly and hence resulted in the failure of your tasks.&lt;/P&gt;&lt;P&gt;However, when you have only a single mapper, there is no such parse needed for CustID column and the data is "copied and pasted" to HDFS and your job ends OK.&lt;/P&gt;</description>
      <pubDate>Fri, 23 Mar 2018 13:18:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191973#M76294</guid>
      <dc:creator>RahulSoni</dc:creator>
      <dc:date>2018-03-23T13:18:28Z</dc:date>
    </item>
    <item>
      <title>Re: why do I get error during sqoop import every time i will not set the mapper to 1,</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191974#M76295</link>
      <description>&lt;P&gt;Okey, Thanks a lot for explaining it to me very well. . I really have a lot of things needed to learn for this job.&lt;/P&gt;</description>
      <pubDate>Sat, 24 Mar 2018 05:00:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/why-do-I-get-error-during-sqoop-import-every-time-i-will-not/m-p/191974#M76295</guid>
      <dc:creator>dcedielunesa</dc:creator>
      <dc:date>2018-03-24T05:00:34Z</dc:date>
    </item>
  </channel>
</rss>

