<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Hive Query slowness in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/154770#M117211</link>
    <description>&lt;P&gt;Couple of things you can check&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Check if the dataset has changed from previous run to current run.&lt;/LI&gt;&lt;LI&gt;Not sure how you are running your query. For e.g, if you are using hive cli, you can use "hive --hiveconf hive.tez.exec.print.summary=true". This should print the pre-execution (compilation, job submission), DAG execution times after the job is complete. That can give hints on where the time is spent&lt;/LI&gt;&lt;LI&gt;If you have tez-ui, that is the best place to start checking the details on where the time is spent.&lt;/LI&gt;&lt;LI&gt;It would be good to share the query and "explain &amp;lt;sql&amp;gt;" output with "--hiveconf hive.explain.user=false". If possible, share "explain formatted &amp;lt;sql&amp;gt;" output which dumps the plan information in JSON format.&lt;/LI&gt;&lt;LI&gt;Check if vertices are running slow due to resource constraints (i.e, some tasks would have started, but others are in waiting mode as resources are not available in queue or in cluster).&lt;/LI&gt;&lt;/OL&gt;</description>
    <pubDate>Thu, 15 Dec 2016 18:59:15 GMT</pubDate>
    <dc:creator>rbalamohan</dc:creator>
    <dc:date>2016-12-15T18:59:15Z</dc:date>
    <item>
      <title>Hive Query slowness</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/154767#M117208</link>
      <description>&lt;P&gt;The hive query which is used by my batch is taking too much time to run.


Earlier when i fire the same query it took around 5 minutes and now it is taking around 22 minutes.

I cant change the query.

Please suggest the correct way to investigate this issue or kindly suggest any resolution.
&lt;/P&gt;</description>
      <pubDate>Thu, 15 Dec 2016 16:15:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/154767#M117208</guid>
      <dc:creator>Yukti</dc:creator>
      <dc:date>2016-12-15T16:15:32Z</dc:date>
    </item>
    <item>
      <title>Re: Hive Query slowness</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/154768#M117209</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/12568/yukagr19.html" nodeid="12568"&gt;@Yukti  Agrawal&lt;/A&gt;&lt;P&gt;First check if the query creates an application and if there any issues at the application end from Resource Manager UI.&lt;/P&gt;</description>
      <pubDate>Thu, 15 Dec 2016 16:28:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/154768#M117209</guid>
      <dc:creator>ssubhas</dc:creator>
      <dc:date>2016-12-15T16:28:25Z</dc:date>
    </item>
    <item>
      <title>Re: Hive Query slowness</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/154769#M117210</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/5019/ssubhas.html" nodeid="5019"&gt;@Sindhu&lt;/A&gt; 



Please find the full logs from Resource Manager UI.



&lt;A href="https://community.cloudera.com/legacyfs/online/attachments/10327-rm-ui-logs.txt"&gt;rm-ui-logs.txt&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 15 Dec 2016 18:02:46 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/154769#M117210</guid>
      <dc:creator>Yukti</dc:creator>
      <dc:date>2016-12-15T18:02:46Z</dc:date>
    </item>
    <item>
      <title>Re: Hive Query slowness</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/154770#M117211</link>
      <description>&lt;P&gt;Couple of things you can check&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Check if the dataset has changed from previous run to current run.&lt;/LI&gt;&lt;LI&gt;Not sure how you are running your query. For e.g, if you are using hive cli, you can use "hive --hiveconf hive.tez.exec.print.summary=true". This should print the pre-execution (compilation, job submission), DAG execution times after the job is complete. That can give hints on where the time is spent&lt;/LI&gt;&lt;LI&gt;If you have tez-ui, that is the best place to start checking the details on where the time is spent.&lt;/LI&gt;&lt;LI&gt;It would be good to share the query and "explain &amp;lt;sql&amp;gt;" output with "--hiveconf hive.explain.user=false". If possible, share "explain formatted &amp;lt;sql&amp;gt;" output which dumps the plan information in JSON format.&lt;/LI&gt;&lt;LI&gt;Check if vertices are running slow due to resource constraints (i.e, some tasks would have started, but others are in waiting mode as resources are not available in queue or in cluster).&lt;/LI&gt;&lt;/OL&gt;</description>
      <pubDate>Thu, 15 Dec 2016 18:59:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/154770#M117211</guid>
      <dc:creator>rbalamohan</dc:creator>
      <dc:date>2016-12-15T18:59:15Z</dc:date>
    </item>
    <item>
      <title>Re: Hive Query slowness</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/154771#M117212</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/311/rbalamohan.html" nodeid="311"&gt;@Rajesh Balamohan&lt;/A&gt; 

The thing is nothing is changed in code or in the volume of data.


There is no change made on cluster in terms of configuration or installation of new things.


And since there is no change made than how suddenly the job is getting slow?&lt;/P&gt;</description>
      <pubDate>Mon, 19 Dec 2016 22:30:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/154771#M117212</guid>
      <dc:creator>Yukti</dc:creator>
      <dc:date>2016-12-19T22:30:09Z</dc:date>
    </item>
    <item>
      <title>Re: Hive Query slowness</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/154772#M117213</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/5019/ssubhas.html" nodeid="5019"&gt;@Sindhu&lt;/A&gt; 

Please suggest anything.&lt;/P&gt;</description>
      <pubDate>Mon, 19 Dec 2016 22:33:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/154772#M117213</guid>
      <dc:creator>Yukti</dc:creator>
      <dc:date>2016-12-19T22:33:35Z</dc:date>
    </item>
    <item>
      <title>Re: Hive Query slowness</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/154773#M117214</link>
      <description>&lt;P&gt;Hi Yukti Agrawal,&lt;/P&gt;&lt;P&gt;Based on the logs that you have attached, it seems that,&lt;/P&gt;&lt;P&gt;1. HA is enabled in your cluster&lt;/P&gt;&lt;P&gt;2. One of your NameNode is down&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Reason :&lt;/STRONG&gt; Hive is not able to reach the active NameNode or None of your NameNodes are active at the time of the process.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Suggestion :&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;1. If NN automatic failover is not enabled, try to manually make the standby NN as active.&lt;/P&gt;&lt;P&gt;2. If NN automatic failover is enabled, try to make Hive aware that NN HA is enabled. The NameNode's IP should be replaced by NameNode's Namespace (HA way of reaching NameNode).&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Log Snippet : &lt;/STRONG&gt;&lt;/P&gt;&lt;PRE&gt;org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.ipc.StandbyException): Operation category READ is not supported in state standby&lt;/PRE&gt;&lt;P&gt;Best Regards,&lt;/P&gt;&lt;P&gt;Adharsh&lt;/P&gt;</description>
      <pubDate>Mon, 19 Dec 2016 23:00:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/154773#M117214</guid>
      <dc:creator>Senthidataguy</dc:creator>
      <dc:date>2016-12-19T23:00:43Z</dc:date>
    </item>
    <item>
      <title>Re: Hive Query slowness</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/154774#M117215</link>
      <description>&lt;P&gt;@Yukti Agrawal&lt;/P&gt;&lt;P&gt;There is a chance that your job might be waiting for resources to be released by other jobs running in the cluster. Its worth checking in RM UI once you execute the query until the state changes to "RUNNING" - where most of the time is being spent. &lt;/P&gt;</description>
      <pubDate>Thu, 22 Dec 2016 20:55:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/154774#M117215</guid>
      <dc:creator>knarendran</dc:creator>
      <dc:date>2016-12-22T20:55:45Z</dc:date>
    </item>
    <item>
      <title>Re: Hive Query slowness</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/154775#M117216</link>
      <description>&lt;P&gt;&lt;A href="https://community.hortonworks.com/users/311/rbalamohan.html"&gt;@Rajesh Balamohan&lt;/A&gt;, I am also facing slowness as days progresses, initially the query was taking 15 mins to complete, which now takes 50 mins to complete. Unlike the problem stated here, my dataset is also growing in size. I am attaching the dumps of formatted explain. Could you please look into it and suggest if anything suspicious found. TIA.&lt;/P&gt;</description>
      <pubDate>Mon, 07 Aug 2017 16:34:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/154775#M117216</guid>
      <dc:creator>mr_masud_sarkar</dc:creator>
      <dc:date>2017-08-07T16:34:15Z</dc:date>
    </item>
    <item>
      <title>Re: Hive Query slowness</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/324459#M229374</link>
      <description>&lt;P&gt;Hi Masud,&lt;/P&gt;&lt;P&gt;I am also facing the same issue now. What is the resolution find out for hive slowness.&lt;/P&gt;&lt;P&gt;Please let me know that helps me.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 16 Sep 2021 00:36:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/324459#M229374</guid>
      <dc:creator>DE_2021</dc:creator>
      <dc:date>2021-09-16T00:36:12Z</dc:date>
    </item>
    <item>
      <title>Re: Hive Query slowness</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/324477#M229379</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/92001"&gt;@DE_2021&lt;/a&gt;,&amp;nbsp;as this is an older post, you would have a better chance of receiving a resolution by&lt;A href="“https://community.cloudera.com/t5/forums/postpage/board-id/Questions”" target="_blank"&gt; starting a new thread&lt;/A&gt;. This will also be an opportunity to provide details specific to your environment that could aid others in assisting you with a more accurate answer to your question. You can link this thread as a reference in your new post.&lt;/P&gt;</description>
      <pubDate>Thu, 16 Sep 2021 06:11:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/324477#M229379</guid>
      <dc:creator>VidyaSargur</dc:creator>
      <dc:date>2021-09-16T06:11:02Z</dc:date>
    </item>
    <item>
      <title>Re: Hive Query execution issue</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/334435#M231762</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;when I run the hive query it showing the below error&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;SPAN&gt;Error while processing statement: FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.mr.MapRedTask&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;But this error is not showing all the time it got&amp;nbsp;succeed with some of the users some times it got failed. Could you please suggest the reason and how to overcome this.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;need urgent. could you please help us.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 24 Jan 2022 10:38:01 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-Query-slowness/m-p/334435#M231762</guid>
      <dc:creator>lav</dc:creator>
      <dc:date>2022-01-24T10:38:01Z</dc:date>
    </item>
  </channel>
</rss>

