<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Set maximum containers on a Hive query in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Set-maximum-containers-on-a-Hive-query/m-p/331091#M230819</link>
    <description>&lt;P&gt;Thanks&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/48553"&gt;@rpathak&lt;/a&gt;&amp;nbsp;- having discussed this further amongst our team we think we are going to try setting up elastic YARN queues to help this situation.&lt;/P&gt;</description>
    <pubDate>Fri, 26 Nov 2021 09:25:34 GMT</pubDate>
    <dc:creator>Andyjmoss</dc:creator>
    <dc:date>2021-11-26T09:25:34Z</dc:date>
    <item>
      <title>Set maximum containers on a Hive query</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Set-maximum-containers-on-a-Hive-query/m-p/330696#M230728</link>
      <description>&lt;P&gt;I have a hive insert statement which by default will use all available resources in YARN as it is reading a large volume of data.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I am happy for the query to take longer and use less resources so that other users can also have access to compute resources.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I don't want to set up YARN queues as this is an unusual query and so don't want to permanently restrict the cluster.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If I was using Spark can do this quite easily with setting a number of executors. Is there a hive config that allows me to do this at a query level.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I have looked at various other posts such as those below, but nothing seems to allow this.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;A href="https://community.cloudera.com/t5/Support-Questions/How-to-control-number-of-containers-in-a-hive-query/td-p/297734" target="_blank" rel="noopener"&gt;https://community.cloudera.com/t5/Support-Questions/How-to-control-number-of-containers-in-a-hive-query/td-p/297734&lt;/A&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://community.cloudera.com/t5/Support-Questions/Is-there-a-way-to-set-minimum-maximum-number-of-containers/m-p/190660#M152749" target="_blank" rel="noopener"&gt;https://community.cloudera.com/t5/Support-Questions/Is-there-a-way-to-set-minimum-maximum-number-of-containers/m-p/190660#M152749&lt;/A&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://community.cloudera.com/t5/Support-Questions/Can-I-limit-the-number-of-containers-allocated-by-Tez/m-p/157020#M119433" target="_blank" rel="noopener"&gt;https://community.cloudera.com/t5/Support-Questions/Can-I-limit-the-number-of-containers-allocated-by-Tez/m-p/157020#M119433&lt;/A&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Also seen this:&amp;nbsp;&lt;A href="https://community.cloudera.com/t5/Support-Questions/How-are-number-of-mappers-determined-for-a-query-with-hive/m-p/94915" target="_blank" rel="noopener"&gt;https://community.cloudera.com/t5/Support-Questions/How-are-number-of-mappers-determined-for-a-query-with-hive/m-p/94915&lt;/A&gt;&amp;nbsp;- but not sure if changing split sizes is a good idea. Would this then impact the structure of data stored by my data.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Grateful for any suggestions.&lt;/P&gt;</description>
      <pubDate>Tue, 23 Nov 2021 20:35:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Set-maximum-containers-on-a-Hive-query/m-p/330696#M230728</guid>
      <dc:creator>Andyjmoss</dc:creator>
      <dc:date>2021-11-23T20:35:02Z</dc:date>
    </item>
    <item>
      <title>Re: Set maximum containers on a Hive query</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Set-maximum-containers-on-a-Hive-query/m-p/330879#M230778</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/93737"&gt;@Andyjmoss&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As you already pointed&amp;nbsp;&lt;A href="https://community.cloudera.com/t5/Support-Questions/How-are-number-of-mappers-determined-for-a-query-with-hive/m-p/94915" target="_blank" rel="noopener"&gt;https://community.cloudera.com/t5/Support-Questions/How-are-number-of-mappers-determined-for-a-query...&lt;/A&gt;&lt;/P&gt;&lt;P&gt;There is no limit per query, you can only adjust max and min grouping size to play around on mapper tasks.&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;Would this then impact the structure of data stored by my data?&lt;/LI-CODE&gt;&lt;P&gt;No this only affects how much data each map task will get.&lt;/P&gt;</description>
      <pubDate>Tue, 23 Nov 2021 20:53:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Set-maximum-containers-on-a-Hive-query/m-p/330879#M230778</guid>
      <dc:creator>rpathak</dc:creator>
      <dc:date>2021-11-23T20:53:18Z</dc:date>
    </item>
    <item>
      <title>Re: Set maximum containers on a Hive query</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Set-maximum-containers-on-a-Hive-query/m-p/331091#M230819</link>
      <description>&lt;P&gt;Thanks&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/48553"&gt;@rpathak&lt;/a&gt;&amp;nbsp;- having discussed this further amongst our team we think we are going to try setting up elastic YARN queues to help this situation.&lt;/P&gt;</description>
      <pubDate>Fri, 26 Nov 2021 09:25:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Set-maximum-containers-on-a-Hive-query/m-p/331091#M230819</guid>
      <dc:creator>Andyjmoss</dc:creator>
      <dc:date>2021-11-26T09:25:34Z</dc:date>
    </item>
  </channel>
</rss>

