<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: How new DAGs are submitted to existing Tez Application Master? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/How-new-DAGs-are-submitted-to-existing-Tez-Application/m-p/163022#M125396</link>
    <description>&lt;P&gt;Yeah I have to say I didn't look into the hive code so I am not sure if you can actually "find" running Tez applications and attach to them. I think its just the TezClient being kept open in hive server/ pig whatever and then submitting more DAGs to the existing AM. But there might be ways for discovery. But basically Tez doesn't take over much of what yarn does. This will be a bit different with LLAP. Which is like a big yarn container running multiple tez tasks. That one will have some workload management, scheduling etc. &lt;/P&gt;&lt;P&gt;&lt;A href="https://tez.apache.org/releases/0.7.1/tez-api-javadocs/org/apache/tez/client/TezClient.html" target="_blank"&gt;https://tez.apache.org/releases/0.7.1/tez-api-javadocs/org/apache/tez/client/TezClient.html&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 04 Aug 2016 16:27:43 GMT</pubDate>
    <dc:creator>bleonhardi</dc:creator>
    <dc:date>2016-08-04T16:27:43Z</dc:date>
    <item>
      <title>How new DAGs are submitted to existing Tez Application Master?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-new-DAGs-are-submitted-to-existing-Tez-Application/m-p/163016#M125390</link>
      <description>&lt;P&gt;As per Tez sessions, DAGs submitted within a session are handled by the same AppMaster. Unable to understand how the new application (DAG) is mapped to the already running AppMaster?? Who does it and how?? As per YARN the resource manager is responsible for launching appmasters. How this functionality is eclipsed by Tez??&lt;/P&gt;&lt;P&gt;Thanks in advance. &lt;/P&gt;</description>
      <pubDate>Wed, 03 Aug 2016 19:37:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-new-DAGs-are-submitted-to-existing-Tez-Application/m-p/163016#M125390</guid>
      <dc:creator>shivanageshch</dc:creator>
      <dc:date>2016-08-03T19:37:15Z</dc:date>
    </item>
    <item>
      <title>Re: How new DAGs are submitted to existing Tez Application Master?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-new-DAGs-are-submitted-to-existing-Tez-Application/m-p/163017#M125391</link>
      <description>&lt;P&gt;Hi Shiva,&lt;/P&gt;&lt;P&gt;Its a Tez client API call you would need to do to find already existing Application Masters of your user in the cluster. You can then hook up with them. The main user at the moment is Hive. Which utilizes it to reduce the startup cost of a query. Essentially each JDBC connection of hive session ( if enabled ) map to one application master in yarn. So when you run a query hive will check if an application master already exists ( using the tez client api calls ) and uses that AM. Or creates a new one otherwise.&lt;/P&gt;</description>
      <pubDate>Wed, 03 Aug 2016 21:53:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-new-DAGs-are-submitted-to-existing-Tez-Application/m-p/163017#M125391</guid>
      <dc:creator>bleonhardi</dc:creator>
      <dc:date>2016-08-03T21:53:56Z</dc:date>
    </item>
    <item>
      <title>Re: How new DAGs are submitted to existing Tez Application Master?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-new-DAGs-are-submitted-to-existing-Tez-Application/m-p/163018#M125392</link>
      <description>&lt;P&gt;So, the handshake between client and AppMaster in YARN(which decommissions once job is done) is continued here in a Tez session. and client submits new DAGs directly to AppMaster and resource manager thinks its still the same application running , so the DAGs run with same application id. &lt;/P&gt;&lt;P&gt;Correct me if i am wrong. &lt;/P&gt;</description>
      <pubDate>Thu, 04 Aug 2016 11:58:01 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-new-DAGs-are-submitted-to-existing-Tez-Application/m-p/163018#M125392</guid>
      <dc:creator>shivanageshch</dc:creator>
      <dc:date>2016-08-04T11:58:01Z</dc:date>
    </item>
    <item>
      <title>Re: How new DAGs are submitted to existing Tez Application Master?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-new-DAGs-are-submitted-to-existing-Tez-Application/m-p/163019#M125393</link>
      <description>&lt;P&gt;The Tez that are aware of the whole DAG of operations can do better
 global optimizations than systems like Hadoop MapReduce which are unaware of the DAG to be executed. &lt;/P&gt;&lt;P&gt;While this is the theory, different systems implement this theory in different ways, and that is where the 
"advantages" and "disadvantages" come from. Computations expressed in Hadoop MapReduce boil down to multiple 
iterations of&lt;/P&gt;&lt;P&gt; (i) read data from HDFS, &lt;/P&gt;&lt;P&gt;(ii) apply map and reduce,&lt;/P&gt;&lt;P&gt; (iii) write back to HDFS. Each map-reduce round is 
completely independent of each other&lt;/P&gt;&lt;P&gt; Hadoop does not have any global knowledge of what MR steps are going to come 
after each MR. For many iterative algorithms this is inefficient as the data between each map-reduce pair gets written 
and read from filesystem. Newer systems like  Tez improves performance over Hadoop by considering the whole DAG 
of map-reduce steps and optimizing it globally &lt;STRONG&gt;(e.g., pipelining consecutive map steps into one, not write intermediate data to HDFS).
 This prevents writing data back and forth after every reduce.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;
If this is helpfull,your close appriciated.&lt;/P&gt;</description>
      <pubDate>Thu, 04 Aug 2016 13:28:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-new-DAGs-are-submitted-to-existing-Tez-Application/m-p/163019#M125393</guid>
      <dc:creator>shivkumar82015</dc:creator>
      <dc:date>2016-08-04T13:28:47Z</dc:date>
    </item>
    <item>
      <title>Re: How new DAGs are submitted to existing Tez Application Master?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-new-DAGs-are-submitted-to-existing-Tez-Application/m-p/163020#M125394</link>
      <description>&lt;P&gt;Thank you &lt;A rel="user" href="https://community.cloudera.com/users/11907/shivkumar82015.html" nodeid="11907"&gt;@Shiv kumar&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 04 Aug 2016 16:18:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-new-DAGs-are-submitted-to-existing-Tez-Application/m-p/163020#M125394</guid>
      <dc:creator>shivanageshch</dc:creator>
      <dc:date>2016-08-04T16:18:15Z</dc:date>
    </item>
    <item>
      <title>Re: How new DAGs are submitted to existing Tez Application Master?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-new-DAGs-are-submitted-to-existing-Tez-Application/m-p/163021#M125395</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/168/bleonhardi.html" nodeid="168"&gt;@Benjamin Leonhardi&lt;/A&gt; &lt;/P&gt;&lt;P&gt;As per YARN appMaster is a mere code. So am unable to figure out how the new DAG can be submitted to existing AppMaster written to handle some other DAG.&lt;/P&gt;</description>
      <pubDate>Thu, 04 Aug 2016 16:21:59 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-new-DAGs-are-submitted-to-existing-Tez-Application/m-p/163021#M125395</guid>
      <dc:creator>shivanageshch</dc:creator>
      <dc:date>2016-08-04T16:21:59Z</dc:date>
    </item>
    <item>
      <title>Re: How new DAGs are submitted to existing Tez Application Master?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-new-DAGs-are-submitted-to-existing-Tez-Application/m-p/163022#M125396</link>
      <description>&lt;P&gt;Yeah I have to say I didn't look into the hive code so I am not sure if you can actually "find" running Tez applications and attach to them. I think its just the TezClient being kept open in hive server/ pig whatever and then submitting more DAGs to the existing AM. But there might be ways for discovery. But basically Tez doesn't take over much of what yarn does. This will be a bit different with LLAP. Which is like a big yarn container running multiple tez tasks. That one will have some workload management, scheduling etc. &lt;/P&gt;&lt;P&gt;&lt;A href="https://tez.apache.org/releases/0.7.1/tez-api-javadocs/org/apache/tez/client/TezClient.html" target="_blank"&gt;https://tez.apache.org/releases/0.7.1/tez-api-javadocs/org/apache/tez/client/TezClient.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 04 Aug 2016 16:27:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-new-DAGs-are-submitted-to-existing-Tez-Application/m-p/163022#M125396</guid>
      <dc:creator>bleonhardi</dc:creator>
      <dc:date>2016-08-04T16:27:43Z</dc:date>
    </item>
    <item>
      <title>Re: How new DAGs are submitted to existing Tez Application Master?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/How-new-DAGs-are-submitted-to-existing-Tez-Application/m-p/163023#M125397</link>
      <description>&lt;P&gt;Yeah see above. I think you just have to have a client like Hive that opens a TezClient, creates an Application master and then submits more DAGs to it. Specifically in Hive you have per default one Tez session per jdbc connection. So if you run multiple queries over the same jdbc connection they use the same Tez client, same Tez session and as long as the timeout is not reached the same application master.&lt;/P&gt;&lt;P&gt;Yes I think it sounds a bit more magical than it is, the reuse is just the session mode where the client can send multiple DAGs to the same Tez AM. As said in LLAP you will have shared long running processes that can be discovered so its a bit different.&lt;/P&gt;&lt;P&gt;&lt;A href="http://hortonworks.com/blog/introducing-tez-sessions/" target="_blank"&gt;http://hortonworks.com/blog/introducing-tez-sessions/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="http://hortonworks.com/blog/introducing-tez-sessions/"&gt;&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="http://hortonworks.com/blog/introducing-tez-sessions/"&gt;&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 04 Aug 2016 16:29:59 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/How-new-DAGs-are-submitted-to-existing-Tez-Application/m-p/163023#M125397</guid>
      <dc:creator>bleonhardi</dc:creator>
      <dc:date>2016-08-04T16:29:59Z</dc:date>
    </item>
  </channel>
</rss>

