<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: oozie sqoop action hangs at 95% in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228871#M67333</link>
    <description>&lt;P&gt;You are welcome, glad you've got it sorted.&lt;/P&gt;</description>
    <pubDate>Fri, 01 Sep 2017 03:17:32 GMT</pubDate>
    <dc:creator>antin_leszczysz</dc:creator>
    <dc:date>2017-09-01T03:17:32Z</dc:date>
    <item>
      <title>oozie sqoop action hangs at 95%</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228860#M67322</link>
      <description>&lt;P&gt;I have a sqoop import that works fine via the command line &lt;/P&gt;&lt;P&gt;~$ sqoop import --connect "jdbc:sqlserver://10.100.197.46:1433;database=rtoISONE" --username hadoop --password XXXXXX --hive-import --hive-database pe  rl3 --hive-overwrite -m 1 --table MaxIndex&lt;/P&gt;&lt;P&gt;but when when I try to run it with a oozie workflow it never leaves the RUNNING phase and when I look at it in yarn it sits at 95%, I know that my oozie is set up correctly for one thing because when I run a shell script under it, it completes with out problem. &lt;/P&gt;&lt;P&gt;workflow.xml&lt;/P&gt;&lt;PRE&gt;&amp;lt;?xml version="1.0" encoding="UTF-8" standalone="no"?&amp;gt;
&amp;lt;workflow-app xmlns="uri:oozie:workflow:0.5" name="sqoop-wf"&amp;gt;  
  &amp;lt;global/&amp;gt;  
  &amp;lt;start to="sqoop"/&amp;gt; 
  &amp;lt;action name="sqoop"&amp;gt;  
    &amp;lt;sqoop xmlns="uri:oozie:sqoop-action:0.3"&amp;gt;  
      &amp;lt;job-tracker&amp;gt;${resourceManager}&amp;lt;/job-tracker&amp;gt;  
      &amp;lt;name-node&amp;gt;${nameNode}&amp;lt;/name-node&amp;gt;
      &amp;lt;command&amp;gt;${command}&amp;lt;/command&amp;gt;  
    &amp;lt;/sqoop&amp;gt;  
    &amp;lt;ok to="end"/&amp;gt; 
    &amp;lt;error to="kill"/&amp;gt;
  &amp;lt;/action&amp;gt;  
  &amp;lt;kill name="kill"&amp;gt;  
    &amp;lt;message&amp;gt;${wf:errorMessage(wf:lastErrorNode())}&amp;lt;/message&amp;gt;  
  &amp;lt;/kill&amp;gt;  &amp;lt;end name="end"/&amp;gt;
&amp;lt;/workflow-app&amp;gt;&lt;/PRE&gt;&lt;P&gt;job.properties&lt;/P&gt;&lt;PRE&gt;nameNode=hdfs://hadoopctrl:8020
resourceManager=hadoopctrl:8050
queueName=default
oozie.use.system.libpath=true
oozie.action.sharelib.for.sqoop=sqoop,hive,hcatalog
oozie.wf.application.path=${nameNode}/user/${user.name}
command=import --connect "jdbc:sqlserver://10.100.197.46:1433;database=rtoISONE" --username hadoop --password XXXXXX --hive-import --hive-database perl3 --hive-overwrite -m 1 --table MaxIndex&lt;/PRE&gt;&lt;P&gt;I have my vcores set to 10 &lt;/P&gt;&lt;P&gt;I have tried adding different property to my workflow&lt;/P&gt;&lt;PRE&gt;&amp;lt;property&amp;gt; 
  &amp;lt;name&amp;gt;mapred.reduce.tasks&amp;lt;/name&amp;gt;  
  &amp;lt;value&amp;gt;-1&amp;lt;/value&amp;gt;  
&amp;lt;/property&amp;gt;  
&amp;lt;property&amp;gt;  
  &amp;lt;name&amp;gt;mapreduce.job.reduces&amp;lt;/name&amp;gt;  
  &amp;lt;value&amp;gt;1&amp;lt;/value&amp;gt;  
&amp;lt;/property&amp;gt;  
&amp;lt;property&amp;gt;  
  &amp;lt;name&amp;gt;mapreduce.job.queuname&amp;lt;/name&amp;gt;  
  &amp;lt;value&amp;gt;launcher2&amp;lt;/value&amp;gt;  
&amp;lt;/property&amp;gt;  
&amp;lt;property&amp;gt;  
  &amp;lt;name&amp;gt;mapred.compress.map.output&amp;lt;/name&amp;gt;  
  &amp;lt;value&amp;gt;true&amp;lt;/value&amp;gt;  
&amp;lt;/property&amp;gt;&lt;/PRE&gt;&lt;P&gt;Any ides any one has would be much appreciated&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;PRE&gt;	&lt;/PRE&gt;</description>
      <pubDate>Sat, 26 Aug 2017 02:54:00 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228860#M67322</guid>
      <dc:creator>joel_carver</dc:creator>
      <dc:date>2017-08-26T02:54:00Z</dc:date>
    </item>
    <item>
      <title>Re: oozie sqoop action hangs at 95%</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228861#M67323</link>
      <description>&lt;P&gt;What do the yarn logs say? Could you post them?&lt;/P&gt;</description>
      <pubDate>Tue, 29 Aug 2017 00:42:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228861#M67323</guid>
      <dc:creator>antin_leszczysz</dc:creator>
      <dc:date>2017-08-29T00:42:58Z</dc:date>
    </item>
    <item>
      <title>Re: oozie sqoop action hangs at 95%</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228862#M67324</link>
      <description>&lt;P&gt;Thanks for taking the time to respond, (little bit of a hadoop noob )&lt;/P&gt;&lt;P&gt;I produced this log by running the command &lt;/P&gt;&lt;P&gt;~$ yarn job -list&lt;/P&gt;&lt;P&gt;taking the ID I found there and running this command &lt;/P&gt;&lt;P&gt;~$ yarn logs -applicationId application_1503692663931_0004 &amp;gt; wflog.log&lt;/P&gt;&lt;P&gt;The file that was produced was around 5000 lines this seems excessive to me, if there is a better way please let me know. I attached the log file. I trimmed the LogType:directory.info to get the file size down.&lt;/P&gt;&lt;P&gt;Thnaks&lt;/P&gt;</description>
      <pubDate>Tue, 29 Aug 2017 02:23:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228862#M67324</guid>
      <dc:creator>joel_carver</dc:creator>
      <dc:date>2017-08-29T02:23:17Z</dc:date>
    </item>
    <item>
      <title>Re: oozie sqoop action hangs at 95%</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228863#M67325</link>
      <description>&lt;P&gt;From the log it seems that your sqoop job gets stuck with heart beat, heart beat... loop.&lt;/P&gt;&lt;P&gt;This is a common result/problem if something has gone wrong. Do search 'oozie sqoop import heart beat'.&lt;/P&gt;&lt;P&gt;But I believe it is potentially a permissions issue, as it has got through 95%.&lt;/P&gt;&lt;P&gt;I suspect that when you run the sqoop job manually you run as 'hdfs' user. Can you confirm this?&lt;/P&gt;&lt;PRE&gt;USER="hdfs"&lt;/PRE&gt;&lt;P&gt;and&lt;/P&gt;&lt;PRE&gt;realUser=oozie&lt;/PRE&gt;&lt;P&gt;Is mentioned in the logs. I suspect the 'oozie' user does not have permission to overwrite the table.&lt;/P&gt;&lt;P&gt;Check to permission of the table. Maybe change permission or ownership for diagnosis, and try again.&lt;/P&gt;</description>
      <pubDate>Tue, 29 Aug 2017 03:15:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228863#M67325</guid>
      <dc:creator>antin_leszczysz</dc:creator>
      <dc:date>2017-08-29T03:15:13Z</dc:date>
    </item>
    <item>
      <title>Re: oozie sqoop action hangs at 95%</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228864#M67326</link>
      <description>&lt;P&gt;How are you getting on? Any luck with this?&lt;/P&gt;</description>
      <pubDate>Tue, 29 Aug 2017 15:46:53 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228864#M67326</guid>
      <dc:creator>antin_leszczysz</dc:creator>
      <dc:date>2017-08-29T15:46:53Z</dc:date>
    </item>
    <item>
      <title>Re: oozie sqoop action hangs at 95%</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228865#M67327</link>
      <description>&lt;P&gt;No luck yet. &lt;BR /&gt;You are correct that I am running the sqoop import form the command line as the hdfs user.&lt;BR /&gt;Thanks for pointing out the realUser=oozie  setting in the logs when I searched the logs I always looked for user and it always said hdfs&lt;/P&gt;&lt;P&gt;The first thing I tried was:&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;
&lt;/P&gt;&lt;P&gt;hadoop fs -chmod -R 777 /apps/hive/warehouse/perl3.db &lt;/P&gt;&lt;P&gt;I also tried adding these properties in my work flow to force oozie to exe as hdfs user, hadoopUser was set in the job file as hdfs&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;
&lt;/P&gt;&lt;PRE&gt;&amp;lt;property&amp;gt;&lt;BR /&gt;  &amp;lt;name&amp;gt;HADOOP_USER_NAME&amp;lt;/name&amp;gt;&lt;BR /&gt;  &amp;lt;value&amp;gt;${hadoopUser}&amp;lt;/value&amp;gt;&lt;BR /&gt;&amp;lt;/property&amp;gt;&lt;BR /&gt;&amp;lt;property&amp;gt;&lt;BR /&gt;  &amp;lt;name&amp;gt;hadoop.proxyuser.oozie.hosts&amp;lt;/name&amp;gt;&lt;BR /&gt;  &amp;lt;value&amp;gt;hadoopctrl,hadoopm1,hadoopm2,hadoopm3&amp;lt;/value&amp;gt;&lt;BR /&gt;&amp;lt;/property&amp;gt;&lt;BR /&gt;&amp;lt;property&amp;gt;&lt;BR /&gt;  &amp;lt;name&amp;gt;hadoop.proxyuser.oozie.groups&amp;lt;/name&amp;gt;&lt;BR /&gt;  &amp;lt;value&amp;gt;hdfs&amp;lt;/value&amp;gt;&lt;BR /&gt;&amp;lt;/property&amp;gt;&lt;/PRE&gt;&lt;P&gt;same result with the job in yarn being stuck at 95%&lt;/P&gt;</description>
      <pubDate>Tue, 29 Aug 2017 22:42:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228865#M67327</guid>
      <dc:creator>joel_carver</dc:creator>
      <dc:date>2017-08-29T22:42:32Z</dc:date>
    </item>
    <item>
      <title>Re: oozie sqoop action hangs at 95%</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228866#M67328</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/43179/joelcarver.html" nodeid="43179"&gt;@Joel Carver&lt;/A&gt;&lt;P&gt; please review my tutorial for the caveats with setting up sqoop action. For example, starting with HDP 2.4 or 2.5, I forget, you need tez-site.xml in your lib directory. &lt;A href="https://community.hortonworks.com/articles/84394/apache-ambari-workflow-manager-view-for-apache-ooz-3.html" target="_blank"&gt;https://community.hortonworks.com/articles/84394/apache-ambari-workflow-manager-view-for-apache-ooz-3.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 29 Aug 2017 23:33:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228866#M67328</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2017-08-29T23:33:13Z</dc:date>
    </item>
    <item>
      <title>Re: oozie sqoop action hangs at 95%</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228867#M67329</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt; &lt;/P&gt;&lt;P&gt;Thanks for pointing to your article it had some good info in it, but still no luck for me. After I uploaded the files and add them to the workflow.xml  the only difference I see is I am using sqlserver jdbc driver.&lt;/P&gt;</description>
      <pubDate>Wed, 30 Aug 2017 03:56:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228867#M67329</guid>
      <dc:creator>joel_carver</dc:creator>
      <dc:date>2017-08-30T03:56:36Z</dc:date>
    </item>
    <item>
      <title>Re: oozie sqoop action hangs at 95%</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228868#M67330</link>
      <description>&lt;P&gt;By doing hadoop fs -chmod -R 777 on your hive table, we can probably eliminate permission issues.&lt;/P&gt;&lt;P&gt;This is a great puzzle.&lt;/P&gt;&lt;P&gt;It should have been raised in the logs, but anything strange about your data? Nulls, NAs, Empty? strange date formats, decimals, special characters?&lt;/P&gt;&lt;P&gt;Anything in &lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt; post that helped?&lt;/P&gt;</description>
      <pubDate>Wed, 30 Aug 2017 04:07:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228868#M67330</guid>
      <dc:creator>antin_leszczysz</dc:creator>
      <dc:date>2017-08-30T04:07:54Z</dc:date>
    </item>
    <item>
      <title>Re: oozie sqoop action hangs at 95%</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228869#M67331</link>
      <description>&lt;P&gt;Also from your log and post hadoopctrl is namenode, resourcemanager, oozie. Is it data node and node manager also? It may be in a bottle neck with memory. Oozie trying to use the memory but yarn can not allocate memory or write the data. Potentially try moving your oozie serve to another node or reduce or redistribute to memory allocation, oozie usually doesn't need too much. This will probably explain the heart beat issue.&lt;/P&gt;</description>
      <pubDate>Wed, 30 Aug 2017 23:04:53 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228869#M67331</guid>
      <dc:creator>antin_leszczysz</dc:creator>
      <dc:date>2017-08-30T23:04:53Z</dc:date>
    </item>
    <item>
      <title>Re: oozie sqoop action hangs at 95%</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228870#M67332</link>
      <description>&lt;P style="margin-left: 20px;"&gt;Ok we have resolved our issues, it was a combination of three things; &lt;A rel="user" href="https://community.cloudera.com/users/11636/antinleszczyszyn.html" nodeid="11636"&gt;@antin leszczyszyn&lt;/A&gt; and &lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt; put me on the right road, I will document how we solved the issues in the hopes that it helps someone else.&lt;/P&gt;&lt;P style="margin-left: 20px;"&gt;1. As Antin pointed out we had a user issue our group had installed apache ranger which changed the hadoop users and &lt;/P&gt;&lt;P&gt;permissions.&lt;/P&gt;&lt;P&gt;2. As Artem pointed out in the link to his tutorial we needed to create a lib folder in the folder that we are running our workflow from and add the jdbc.jar file and add the hive-site.xml and tez-site.xml .&lt;/P&gt;&lt;P&gt;3. When trying to trouble shoot this problem we had changed the scheduler to the fair version,  we changed it back to &lt;/P&gt;&lt;P&gt;capacity scheduler and changed maximum-am-resource-percent=0.2 to 0.6&lt;/P&gt;&lt;P&gt;Thanks for the help&lt;/P&gt;</description>
      <pubDate>Thu, 31 Aug 2017 19:25:53 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228870#M67332</guid>
      <dc:creator>joel_carver</dc:creator>
      <dc:date>2017-08-31T19:25:53Z</dc:date>
    </item>
    <item>
      <title>Re: oozie sqoop action hangs at 95%</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228871#M67333</link>
      <description>&lt;P&gt;You are welcome, glad you've got it sorted.&lt;/P&gt;</description>
      <pubDate>Fri, 01 Sep 2017 03:17:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/oozie-sqoop-action-hangs-at-95/m-p/228871#M67333</guid>
      <dc:creator>antin_leszczysz</dc:creator>
      <dc:date>2017-09-01T03:17:32Z</dc:date>
    </item>
  </channel>
</rss>

