<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: sqoop free form query import to hbase having issue ... while using multiple mappers its importing multiple times in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/sqoop-free-form-query-import-to-hbase-having-issue-while/m-p/134428#M18881</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt; table name was my typo error. Provided the name in query ... And query is not throwing any error while importing but imports more than 100 .
strictly speacking number of records imported to hbase is getting multiplied by the number of mappers allocated to the task .
For example : if top 100 are selected and 4 mappers are running for the task , 400 records are being imported to &lt;STRONG&gt;hbase &lt;/STRONG&gt;table.&lt;/P&gt;</description>
    <pubDate>Wed, 10 Feb 2016 17:24:12 GMT</pubDate>
    <dc:creator>klksrinivas</dc:creator>
    <dc:date>2016-02-10T17:24:12Z</dc:date>
    <item>
      <title>sqoop free form query import to hbase having issue ... while using multiple mappers its importing multiple times</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/sqoop-free-form-query-import-to-hbase-having-issue-while/m-p/134426#M18879</link>
      <description>&lt;P&gt;sqoop free form query import to hbase having issue ... while using multiple mappers its importing multiple times sqoop import --connect "jdbc:sqlserver://;database=;username=;password=" --query 'select top 100000 * from where $CONDITIONS' --split-by ID --hbase-table --column-family info --hbase-create-table -m 4 ---- this import query is importing 400000 in place of 10000&lt;/P&gt;</description>
      <pubDate>Wed, 10 Feb 2016 14:44:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/sqoop-free-form-query-import-to-hbase-having-issue-while/m-p/134426#M18879</guid>
      <dc:creator>klksrinivas</dc:creator>
      <dc:date>2016-02-10T14:44:34Z</dc:date>
    </item>
    <item>
      <title>Re: sqoop free form query import to hbase having issue ... while using multiple mappers its importing multiple times</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/sqoop-free-form-query-import-to-hbase-having-issue-while/m-p/134427#M18880</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/2508/klksrinivas.html" nodeid="2508"&gt;@Krishna Srinivas&lt;/A&gt;&lt;P&gt;i don't see table name after from was that intentional? &lt;/P&gt;&lt;PRE&gt;--single quotes, $CONDITIONS does not need to be escaped
--query ' select top 100 * from dbo.[Orders] where $CONDITIONS ' \ 
   --split-by callID -m 10&lt;/PRE&gt;&lt;P&gt;Here's a good resource &lt;A href="https://danieladeniji.wordpress.com/category/technical/hadoop/sqoop/" target="_blank"&gt;https://danieladeniji.wordpress.com/category/technical/hadoop/sqoop/&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 10 Feb 2016 16:27:51 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/sqoop-free-form-query-import-to-hbase-having-issue-while/m-p/134427#M18880</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-02-10T16:27:51Z</dc:date>
    </item>
    <item>
      <title>Re: sqoop free form query import to hbase having issue ... while using multiple mappers its importing multiple times</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/sqoop-free-form-query-import-to-hbase-having-issue-while/m-p/134428#M18881</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt; table name was my typo error. Provided the name in query ... And query is not throwing any error while importing but imports more than 100 .
strictly speacking number of records imported to hbase is getting multiplied by the number of mappers allocated to the task .
For example : if top 100 are selected and 4 mappers are running for the task , 400 records are being imported to &lt;STRONG&gt;hbase &lt;/STRONG&gt;table.&lt;/P&gt;</description>
      <pubDate>Wed, 10 Feb 2016 17:24:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/sqoop-free-form-query-import-to-hbase-having-issue-while/m-p/134428#M18881</guid>
      <dc:creator>klksrinivas</dc:creator>
      <dc:date>2016-02-10T17:24:12Z</dc:date>
    </item>
    <item>
      <title>Re: sqoop free form query import to hbase having issue ... while using multiple mappers its importing multiple times</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/sqoop-free-form-query-import-to-hbase-having-issue-while/m-p/134429#M18882</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/2508/klksrinivas.html" nodeid="2508"&gt;@Krishna Srinivas&lt;/A&gt;&lt;P&gt;Please see this  &lt;A href="https://sqoop.apache.org/docs/1.4.2/SqoopUserGuide.html#_free_form_query_imports" target="_blank"&gt;https://sqoop.apache.org/docs/1.4.2/SqoopUserGuide.html#_free_form_query_imports&lt;/A&gt;&lt;/P&gt;&lt;P&gt;If you want to import the results of a query in parallel, then each map task will need to execute a copy of the query, with results partitioned by bounding conditions inferred by Sqoop. Your query must include the token &lt;CODE&gt;$CONDITIONS&lt;/CODE&gt; which each Sqoop process will replace with a unique condition expression. You must also select a splitting column with &lt;CODE&gt;--split-by&lt;/CODE&gt;.&lt;/P&gt;&lt;P&gt;Also, there is probability of experiencing weird results when run in parallel. I don't have jira detials&lt;/P&gt;</description>
      <pubDate>Wed, 10 Feb 2016 18:24:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/sqoop-free-form-query-import-to-hbase-having-issue-while/m-p/134429#M18882</guid>
      <dc:creator>nsabharwal</dc:creator>
      <dc:date>2016-02-10T18:24:07Z</dc:date>
    </item>
    <item>
      <title>Re: sqoop free form query import to hbase having issue ... while using multiple mappers its importing multiple times</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/sqoop-free-form-query-import-to-hbase-having-issue-while/m-p/134430#M18883</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/140/nsabharwal.html" nodeid="140"&gt;@Neeraj Sabharwal&lt;/A&gt; So, I take it as a best practice to import using single mapper while using free form queries &lt;/P&gt;</description>
      <pubDate>Wed, 10 Feb 2016 19:37:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/sqoop-free-form-query-import-to-hbase-having-issue-while/m-p/134430#M18883</guid>
      <dc:creator>klksrinivas</dc:creator>
      <dc:date>2016-02-10T19:37:18Z</dc:date>
    </item>
    <item>
      <title>Re: sqoop free form query import to hbase having issue ... while using multiple mappers its importing multiple times</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/sqoop-free-form-query-import-to-hbase-having-issue-while/m-p/134431#M18884</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/2508/klksrinivas.html" nodeid="2508"&gt;@Krishna Srinivas&lt;/A&gt;, Using multiple mappers is good practice also for free-form queries, however you have to keep in mind what's your free-form query doing. Each mapper will run a copy of the query with additional WHERE conditions to split the table based on the "--split-by" column. So, in your case each mapper will return 100k records per split, for the total of 400k. If you want 100k per table then you should set use "TOP 25000 ..." For the majority of free-form queries like "WHERE a&amp;gt;100 and b&amp;gt;300" you don't have to worry about the number of records.&lt;/P&gt;</description>
      <pubDate>Thu, 11 Feb 2016 08:08:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/sqoop-free-form-query-import-to-hbase-having-issue-while/m-p/134431#M18884</guid>
      <dc:creator>pminovic</dc:creator>
      <dc:date>2016-02-11T08:08:22Z</dc:date>
    </item>
  </channel>
</rss>

