<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Impala on yarn in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Impala-on-yarn/m-p/40313#M26535</link>
    <description>I don't have a ton of experience with Llama, but I think the&lt;BR /&gt;misunderstanding here is that Impala manages the execution of its own&lt;BR /&gt;queries, and the MapReduce framework manages the execution of Hive queries.&lt;BR /&gt;YARN manages resources for individual MapReduce jobs, and it can manage the&lt;BR /&gt;Impala daemons via Llama. The YARN application for Llama will run as long&lt;BR /&gt;as Impala does - that's by design to keep the latency of Impala queries&lt;BR /&gt;very low. In the case of Hive, YARN will manage the job's resources only&lt;BR /&gt;until that job (a single query) is finished.&lt;BR /&gt;&lt;BR /&gt;Not sure why your Hive queries would not be running. If this is in the&lt;BR /&gt;QuickStart VM, my first guess would be that if Llama is still running and&lt;BR /&gt;there aren't enough executors / slots for your Hive queries. YARN in the&lt;BR /&gt;QuickStart VM is not going to be configured with a lot of capacity and it's&lt;BR /&gt;not tested with Llama.&lt;BR /&gt;&lt;BR /&gt;I know of no other way to manage Impala resources via YARN, though.&lt;BR /&gt;</description>
    <pubDate>Fri, 29 Apr 2016 14:04:08 GMT</pubDate>
    <dc:creator>Sean</dc:creator>
    <dc:date>2016-04-29T14:04:08Z</dc:date>
    <item>
      <title>Impala on yarn</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Impala-on-yarn/m-p/40307#M26534</link>
      <description>&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;&lt;DIV class="vote"&gt;&lt;SPAN class="vote-count-post "&gt;0&lt;/SPAN&gt;&lt;A title="This question does not show any research effort; it is unclear or not useful" target="_blank"&gt;down vote&lt;/A&gt;&lt;A title="Click to mark as favorite question (click again to undo)" href="http://stackoverflow.com/questions/36931014/running-impala-queries-on-yarn" target="_blank"&gt;favorite&lt;/A&gt;&lt;DIV class="favoritecount"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/TD&gt;&lt;TD&gt;&lt;DIV&gt;&lt;DIV class="post-text"&gt;&lt;P&gt;I am running a set of queries in Hive and Impala in cloudera cluster. As we know cloudera runs hive queries on yarn but not the impala queries. I want to run the impala queries on yarn. I tried it with impala Llama but what happened is when i set the cluster for Llama, the queries were running but while looking at cloudera manager under yarn application its showing running until I didn't killed it, also after doing all these settings my Hive queries are not running, they are all getting failed. Can anyone please tell me how can I do it, Is there any other way to run the impala query on yarn?&lt;/P&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;</description>
      <pubDate>Fri, 16 Sep 2022 10:16:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Impala-on-yarn/m-p/40307#M26534</guid>
      <dc:creator>apple123</dc:creator>
      <dc:date>2022-09-16T10:16:05Z</dc:date>
    </item>
    <item>
      <title>Re: Impala on yarn</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Impala-on-yarn/m-p/40313#M26535</link>
      <description>I don't have a ton of experience with Llama, but I think the&lt;BR /&gt;misunderstanding here is that Impala manages the execution of its own&lt;BR /&gt;queries, and the MapReduce framework manages the execution of Hive queries.&lt;BR /&gt;YARN manages resources for individual MapReduce jobs, and it can manage the&lt;BR /&gt;Impala daemons via Llama. The YARN application for Llama will run as long&lt;BR /&gt;as Impala does - that's by design to keep the latency of Impala queries&lt;BR /&gt;very low. In the case of Hive, YARN will manage the job's resources only&lt;BR /&gt;until that job (a single query) is finished.&lt;BR /&gt;&lt;BR /&gt;Not sure why your Hive queries would not be running. If this is in the&lt;BR /&gt;QuickStart VM, my first guess would be that if Llama is still running and&lt;BR /&gt;there aren't enough executors / slots for your Hive queries. YARN in the&lt;BR /&gt;QuickStart VM is not going to be configured with a lot of capacity and it's&lt;BR /&gt;not tested with Llama.&lt;BR /&gt;&lt;BR /&gt;I know of no other way to manage Impala resources via YARN, though.&lt;BR /&gt;</description>
      <pubDate>Fri, 29 Apr 2016 14:04:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Impala-on-yarn/m-p/40313#M26535</guid>
      <dc:creator>Sean</dc:creator>
      <dc:date>2016-04-29T14:04:08Z</dc:date>
    </item>
  </channel>
</rss>

