<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Pig View Will Not Run, but Grunt CLI Will in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Pig-View-Will-Not-Run-but-Grunt-CLI-Will/m-p/148059#M110588</link>
    <description>&lt;P&gt;&lt;A href="https://community.hortonworks.com/questions/40798/pig-view-error-stderr-file-does-not-exist.html#"&gt;@Colton Rodgers&lt;/A&gt;
&lt;/P&gt;&lt;P&gt;Can you provide your hadoop.proxyusers.* properties settings? &lt;/P&gt;</description>
    <pubDate>Tue, 21 Jun 2016 00:08:01 GMT</pubDate>
    <dc:creator>rpathak</dc:creator>
    <dc:date>2016-06-21T00:08:01Z</dc:date>
    <item>
      <title>Pig View Will Not Run, but Grunt CLI Will</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Pig-View-Will-Not-Run-but-Grunt-CLI-Will/m-p/148057#M110586</link>
      <description>&lt;P&gt;
 Hello All,
 &lt;/P&gt;&lt;P&gt; 
I will preface with, I have seen multiple questions of similar nature and have tried each of the solutions, but to no avail on my end and feel that a more in depth explanation may help others as well if they are to ever arrive at a similar issue.&lt;/P&gt;&lt;P&gt;The pig view fails, but the Grunt&amp;gt; CLI runs fine, so I am thinking that it may be a PIG View configuration error.&lt;/P&gt;&lt;P&gt;I started by researching the jira located at &lt;A href="https://issues.apache.org/jira/browse/AMBARI-12738" target="_blank" rel="nofollow noopener noreferrer"&gt;https://issues.apache.org/jira/browse/AMBARI-12738&lt;/A&gt;&lt;/P&gt;&lt;P&gt;
 I am trying to use the Pig View in Ambari 2.2.1 on HDP 2.4.2 and am running into a multitude of errors. &lt;/P&gt;&lt;P&gt; The script that I am running is&lt;/P&gt;&lt;PRE&gt;logs = LOAD 'server_logs.error_logs' USING org.apache.hive.hcatalog.pig.HCatLoader();
DUMP logs;
&lt;/PRE&gt;&lt;P&gt;
 The job will fail with a "Job failed to start" Error which then only has a stack trace of &lt;/P&gt;&lt;PRE&gt;java.net.SocketTimeoutException: Read timed out

java.net.SocketTimeoutException: Read timed out&lt;/PRE&gt;&lt;P&gt;In the history logs within the view I receive the following error only&lt;/P&gt;&lt;PRE&gt;File /user/admin/pig/jobs/errlogs_20-06-2016-15-11-39/stderr not found.&lt;/PRE&gt;&lt;P&gt;I have tried this for user hdfs and admin the same problem remains, I have also just tried to load a file with PigStorage('|'), but that also returned me the same issue.&lt;/P&gt;&lt;P&gt;Using both Tez and MR ExecTypes, I receive the same error. &lt;/P&gt;&lt;P&gt;The NameNode and ResourceManager are both in High Availability mode.&lt;/P&gt;&lt;P&gt;I have added the appropriate proxyuser configs to both the &lt;STRONG&gt;core-site&lt;/STRONG&gt; and &lt;STRONG&gt;hcat-site&lt;/STRONG&gt; in HDFS and Hive configurations.
I have restarted &lt;STRONG&gt;all services &lt;EM&gt; and &lt;/EM&gt;Ambari-Server&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;The stderr file &lt;STRONG&gt;&lt;EM&gt;is&lt;/EM&gt;&lt;/STRONG&gt; created within the &lt;/P&gt;&lt;PRE&gt;/user/admin/pig/jobs/errlogs_20-06-2016-15-11-39/ &lt;/PRE&gt;&lt;P&gt;directory, but does not have anything written to it. The admin/pig/ directory has full go+w 777 -R permissions, but when the stderr file is created it will only show as having 644 permissions.&lt;/P&gt;&lt;P&gt;Against my better judgement I issued an &lt;/P&gt;&lt;PRE&gt;hdfs dfs -chmod -R 777 /user&lt;/PRE&gt;&lt;P&gt;command to see if it was an underlying permissions issue on a file unbeknownst  to me, but that also left me with the same outcome.&lt;/P&gt;&lt;P&gt;The Resource Manager Logs show that the application is submitted and continues to hang in the RUNNING state even after the job has been noted as "Failed to Start" through Ambari.&lt;/P&gt;&lt;P&gt;yarn application -list shows that there are no running Applications as well.&lt;/P&gt;&lt;P&gt;Has anyone figured out a solution to this problem? The stacktraces are not helpful, given they do not output more than 1-2 lines of information. My Pig View Cluster configuration is as follows:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="5127-pig-config.png" style="width: 529px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/21345i73F6D64A6A96FDA0/image-size/medium?v=v2&amp;amp;px=400" role="button" title="5127-pig-config.png" alt="5127-pig-config.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 13:15:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Pig-View-Will-Not-Run-but-Grunt-CLI-Will/m-p/148057#M110586</guid>
      <dc:creator>colton_rodgers</dc:creator>
      <dc:date>2019-08-18T13:15:34Z</dc:date>
    </item>
    <item>
      <title>Re: Pig View Will Not Run, but Grunt CLI Will</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Pig-View-Will-Not-Run-but-Grunt-CLI-Will/m-p/148058#M110587</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt;&lt;P&gt;Do you have any ideas?&lt;/P&gt;</description>
      <pubDate>Mon, 20 Jun 2016 23:49:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Pig-View-Will-Not-Run-but-Grunt-CLI-Will/m-p/148058#M110587</guid>
      <dc:creator>colton_rodgers</dc:creator>
      <dc:date>2016-06-20T23:49:08Z</dc:date>
    </item>
    <item>
      <title>Re: Pig View Will Not Run, but Grunt CLI Will</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Pig-View-Will-Not-Run-but-Grunt-CLI-Will/m-p/148059#M110588</link>
      <description>&lt;P&gt;&lt;A href="https://community.hortonworks.com/questions/40798/pig-view-error-stderr-file-does-not-exist.html#"&gt;@Colton Rodgers&lt;/A&gt;
&lt;/P&gt;&lt;P&gt;Can you provide your hadoop.proxyusers.* properties settings? &lt;/P&gt;</description>
      <pubDate>Tue, 21 Jun 2016 00:08:01 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Pig-View-Will-Not-Run-but-Grunt-CLI-Will/m-p/148059#M110588</guid>
      <dc:creator>rpathak</dc:creator>
      <dc:date>2016-06-21T00:08:01Z</dc:date>
    </item>
    <item>
      <title>Re: Pig View Will Not Run, but Grunt CLI Will</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Pig-View-Will-Not-Run-but-Grunt-CLI-Will/m-p/148060#M110589</link>
      <description>&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="5128-hadoop-proxyconfig.png" style="width: 883px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/21344i2A4B18F94A6A1EEF/image-size/medium?v=v2&amp;amp;px=400" role="button" title="5128-hadoop-proxyconfig.png" alt="5128-hadoop-proxyconfig.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/10322/rpathak.html" nodeid="10322" target="_blank"&gt;@Rahul Pathak&lt;/A&gt; This is what the current listing of hadoop.proxyuser is&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 13:15:26 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Pig-View-Will-Not-Run-but-Grunt-CLI-Will/m-p/148060#M110589</guid>
      <dc:creator>colton_rodgers</dc:creator>
      <dc:date>2019-08-18T13:15:26Z</dc:date>
    </item>
    <item>
      <title>Re: Pig View Will Not Run, but Grunt CLI Will</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Pig-View-Will-Not-Run-but-Grunt-CLI-Will/m-p/148061#M110590</link>
      <description>&lt;P&gt;From what I can tell this ended up being an &lt;STRONG&gt;available resources issue&lt;/STRONG&gt;. I logged back in at midnight when all users had left and everything seems to be working correctly.

Some of the time the Pig Job will say that it failed to start, but in the stderr/stdout it will show the results of the DUMP that I am trying to perform and since it was working fine in the grunt CLI, this was a very tricky problem to uncover.&lt;/P&gt;</description>
      <pubDate>Tue, 21 Jun 2016 19:05:51 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Pig-View-Will-Not-Run-but-Grunt-CLI-Will/m-p/148061#M110590</guid>
      <dc:creator>colton_rodgers</dc:creator>
      <dc:date>2016-06-21T19:05:51Z</dc:date>
    </item>
    <item>
      <title>Re: Pig View Will Not Run, but Grunt CLI Will</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Pig-View-Will-Not-Run-but-Grunt-CLI-Will/m-p/148062#M110591</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/2161/coltonrodgers.html" nodeid="2161"&gt;@Colton Rodgers&lt;/A&gt; I assume webhcat/templeton not responding fast enough to the call hence the read timeout error.  The grunt shell call does not go through webhcat I believe. Any suggestions from your perspective that we can do better to improving of isolating the issue?   &lt;/P&gt;</description>
      <pubDate>Wed, 22 Jun 2016 23:39:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Pig-View-Will-Not-Run-but-Grunt-CLI-Will/m-p/148062#M110591</guid>
      <dc:creator>rmolina</dc:creator>
      <dc:date>2016-06-22T23:39:56Z</dc:date>
    </item>
    <item>
      <title>Re: Pig View Will Not Run, but Grunt CLI Will</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Pig-View-Will-Not-Run-but-Grunt-CLI-Will/m-p/148063#M110592</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/527/rmolina.html" nodeid="527"&gt;@rmolina&lt;/A&gt; The webhcat response time did end up seeming to be the result of the issue. I do believe that being able to block off a certain amount of memory for WebHCat specifically ( I think this may already be available with &lt;STRONG&gt;templeton.mapper.memory.mb &lt;/STRONG&gt;), but that is just the mapper memory and I haven't looked too much farther into it. When there are no other users using the cluster, the Pig GUI view will run fine, but as that is not going to be the case for most Prod clusters that we deploy, I think that being able to set a reserve specifically in the WebHCat-Env or WebHCat-Site could prove to be useful in making sure the resources are properly allocated.&lt;/P&gt;</description>
      <pubDate>Thu, 23 Jun 2016 00:42:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Pig-View-Will-Not-Run-but-Grunt-CLI-Will/m-p/148063#M110592</guid>
      <dc:creator>colton_rodgers</dc:creator>
      <dc:date>2016-06-23T00:42:42Z</dc:date>
    </item>
  </channel>
</rss>

