<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Run Oryx on a machine that is not part of the cluster in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29164#M4227</link>
    <description>&lt;P&gt;Just to check, you have this commit right?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://github.com/cloudera/oryx/commit/4b5e557a36f3d666bab0befc21b79efdf1fcd52d" target="_blank"&gt;https://github.com/cloudera/oryx/commit/4b5e557a36f3d666bab0befc21b79efdf1fcd52d&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The symptom here is that the App Master for the MR job dies straight away, and can't be contacted. The important thing is to know why. For example when I looked at the AM app screen (i.e.&amp;nbsp;http://[host]:8088/cluster/app/application_1435553713675_0018) I saw something like ...&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;Application application_1435553713675_0018 failed 2 times due to AM Container for appattempt_1435553713675_0018_000002 exited with exitCode: -104&lt;/DIV&gt;&lt;DIV&gt;For more detailed output, check application tracking page:http://&lt;SPAN&gt;[host]&lt;/SPAN&gt;:8088/proxy/application_1435553713675_0018/Then, click on links to logs of each attempt.&lt;/DIV&gt;&lt;DIV&gt;Diagnostics: Container [pid=13840,containerID=container_1435553713675_0018_02_000001] is running beyond physical memory limits. Current usage: 421.5 MB of 384 MB physical memory used; 2.7 GB of 806.4 MB virtual memory used. Killing container.&lt;/DIV&gt;&lt;DIV&gt;...&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;Do you see anything like that that says why the AM stopped?&lt;/DIV&gt;</description>
    <pubDate>Wed, 01 Jul 2015 16:34:40 GMT</pubDate>
    <dc:creator>srowen</dc:creator>
    <dc:date>2015-07-01T16:34:40Z</dc:date>
    <item>
      <title>Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/22985#M4215</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I am trying to run Oryx on a machine that is not part of the cluster...&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;My setting for the oryx.conf is as below (about the Hadoop/HDFS settings)... Is that a right setting ?&lt;/P&gt;&lt;P&gt;Is there something else I need to set for the oryx.conf&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV class="line"&gt;model=${als-model}&lt;/DIV&gt;&lt;DIV class="line"&gt;model.instance-dir=hdfs://name_node:8020/oryx_data&lt;/DIV&gt;&lt;DIV class="line"&gt;model.local-computation=false&lt;/DIV&gt;&lt;DIV class="line"&gt;model.local-data=false&lt;/DIV&gt;&lt;DIV class="line"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="line"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="line"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="line"&gt;Thanks.&lt;/DIV&gt;&lt;DIV class="line"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="line"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="line"&gt;&amp;nbsp;&lt;/DIV&gt;</description>
      <pubDate>Fri, 16 Sep 2022 15:39:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/22985#M4215</guid>
      <dc:creator>Jason.Chen</dc:creator>
      <dc:date>2022-09-16T15:39:34Z</dc:date>
    </item>
    <item>
      <title>Re: Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/23007#M4216</link>
      <description>&lt;P&gt;That's fine. The machine needs to be able to communicate with the cluster of course. Usually you would make the Hadoop configuration visible as well and point to it with HADOOP_CONF_DIR. I think that will be required to get MapReduce to work.&lt;/P&gt;</description>
      <pubDate>Tue, 23 Dec 2014 08:47:11 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/23007#M4216</guid>
      <dc:creator>srowen</dc:creator>
      <dc:date>2014-12-23T08:47:11Z</dc:date>
    </item>
    <item>
      <title>Re: Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/26905#M4217</link>
      <description>&lt;P&gt;Sean,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;A follow up question:&lt;/P&gt;&lt;P&gt;I want to know when Oryx will update the information obtained from the Hadoop configure files.&lt;/P&gt;&lt;P&gt;I mean, when Oryx computation and serving layers start, the Hadoop config files are read.&lt;/P&gt;&lt;P&gt;Then, if there are changes for Hadoop configure files, should I restart Oryx&amp;nbsp;computation and serving&amp;nbsp;layers in order to get updated config files ?&lt;/P&gt;&lt;P&gt;In other words, when Oryx computation and serving layers read Hadoop configuration files ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 01 May 2015 05:58:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/26905#M4217</guid>
      <dc:creator>Jason.Chen</dc:creator>
      <dc:date>2015-05-01T05:58:13Z</dc:date>
    </item>
    <item>
      <title>Re: Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/26906#M4218</link>
      <description>&lt;P&gt;Yes, it reads them at startup, so you would need to restart the processes.&lt;/P&gt;</description>
      <pubDate>Fri, 01 May 2015 07:33:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/26906#M4218</guid>
      <dc:creator>srowen</dc:creator>
      <dc:date>2015-05-01T07:33:25Z</dc:date>
    </item>
    <item>
      <title>Re: Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29035#M4219</link>
      <description>&lt;P&gt;Sean,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have some follow-up questions regarding this topic ("Run Oryx on a machine that is not part of the cluster")..&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We started to test the case that Oryx 1.0 computation/serving layers running on VMs that are in different virtual LAN from the Hadoop&lt;/P&gt;&lt;P&gt;Cluster. There are firewall port issues for the communication between the two virtual LANs.&lt;/P&gt;&lt;P&gt;Therefore, we opened the all the Hadoop used ports on the Hadoop Cluster virtual LAN, so that the Oryx VMs can talk to it.&lt;/P&gt;&lt;P&gt;We got the "Hadoop used port list" from both the Hadoop configuration files and also some online Cloudera CDH port info.&lt;/P&gt;&lt;P&gt;After doing that, yes, Oryx is able to submit jobs to Hadoop cluster at some level. However, it still drops some communications issue.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For example, from the Oryx log, I see something like this&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Retrying connect to server: server-name/10.190.36.113:40651. Already tried 0 time(s); maxRetries=3&lt;/P&gt;&lt;P&gt;Retrying connect to server: server-name/10.190.36.113:40651. Already tried 1 time(s); maxRetries=3&lt;/P&gt;&lt;P&gt;....&lt;/P&gt;&lt;P&gt;Retrying connect to server: server-name/10.190.36.114:40915. Already tried 0 time(s); maxRetries=3&lt;/P&gt;&lt;P&gt;Retrying connect to server: server-name/10.190.36.114:40915. Already tried 1 time(s); maxRetries=3&lt;/P&gt;&lt;P&gt;....&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I dig into the codes and I do not understand why this could happen. My questions...&lt;/P&gt;&lt;P&gt;(1) Is the communication between Oryx to Hadoop is bidirectional OR unidirectional?&lt;/P&gt;&lt;P&gt;My understanding is that Oryx uses the Hadoop configuration files to get the idea where (server and port) it should submit the jobs.&lt;/P&gt;&lt;P&gt;After Oryx submits the job, how Oryx knows the job is completed? Does Oryx check with Hadoop to get the status? Or, Hadoop&amp;nbsp;&lt;/P&gt;&lt;P&gt;communicates back to Oryx VM regarding the status ?&lt;/P&gt;&lt;P&gt;(2) Related to (1) and the log info I post above: Are there "dynamic" ports are used during the Oryx-Hadoop communications? From the log&lt;/P&gt;&lt;P&gt;message, I see ports 40651 and 40915.. They seem to not standard Hadoop ports and even these port numbers are dynamically changing.&lt;/P&gt;&lt;P&gt;This is confusing.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 28 Jun 2015 18:23:55 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29035#M4219</guid>
      <dc:creator>Jason.Chen</dc:creator>
      <dc:date>2015-06-28T18:23:55Z</dc:date>
    </item>
    <item>
      <title>Re: Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29036#M4220</link>
      <description>&lt;P&gt;It doesn't do any communciation of its own; this is all traffic to/from the Hadoop cluster for HDFS and YARN. Hadoop has no idea about the oryx process. It should be dead simple in this regard.&amp;nbsp;I don't think those are well-known ports so maybe this is it trying to talk to the YARN app that runs the MapReduce? what is failing at that point?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I would expect the serving to be more predictable as it only needs to talk to HDFS and those daemons should be on well known ports. In any event it's "just" standard Hadoop mechanisms here, which may mean you can ask support for assistance about how to&amp;nbsp;constrain the ports that are used? but in general the computation layer needs to be close to the cluster and is intended to be inside its firewall.&lt;/P&gt;</description>
      <pubDate>Sun, 28 Jun 2015 18:33:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29036#M4220</guid>
      <dc:creator>srowen</dc:creator>
      <dc:date>2015-06-28T18:33:22Z</dc:date>
    </item>
    <item>
      <title>Re: Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29047#M4221</link>
      <description>&lt;P&gt;It drops the connection issue in RowStep..&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;One example of the detailed log is as below (I slightly modified the info to hide some sensitive server info, but it keeps main messages):&lt;/P&gt;&lt;P&gt;One thing odd is that even it reports cannot reaching the server:port (say 10.190.36.114:40915) as below, it's still&lt;/P&gt;&lt;P&gt;eventually completing the job. I am thinking maybe it completes with other nodes in a "standard" port?&lt;/P&gt;&lt;P&gt;However, it's still not good a sign seeing cannot connect to server, because it introduces unnecessary running time.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;/// Logs ////&lt;/P&gt;&lt;P&gt;Thu May 28 07:27:57 PDT 2015 INFO Running job "Oryx-/user/xyz/int/def-1-122-Y-RowStep: Avro(hdfs://server105:8020/u... ID=1 (1/1)"&lt;BR /&gt;Thu May 28 07:27:57 PDT 2015 INFO Job status available at: http://server105:8088/proxy/application_1432750221048_0525/&lt;BR /&gt;Thu May 28 07:29:14 PDT 2015 INFO Retrying connect to server: server104/10.190.36.114:40915. Already tried 0 time(s); maxRetries=3&lt;/P&gt;&lt;P&gt;Thu May 28 07:29:14 PDT 2015 INFO Retrying connect to server: server104/10.190.36.114:40915. Already tried 1 time(s); maxRetries=3&lt;/P&gt;&lt;P&gt;Thu May 28 07:29:14 PDT 2015 INFO Retrying connect to server: server104/10.190.36.114:40915. Already tried 2 time(s); maxRetries=3&lt;BR /&gt;...&lt;BR /&gt;Thu May 28 07:34:15 PDT 2015 INFO Application state is completed. FinalApplicationStatus=SUCCEEDED. Redirecting to job history server&lt;BR /&gt;Thu May 28 07:34:16 PDT 2015 INFO Finished Oryx-/user/xyz/int/def-1-122-Y-RowStep&lt;BR /&gt;Thu May 28 07:34:16 PDT 2015 INFO Completed RowStep in 379s&lt;/P&gt;</description>
      <pubDate>Mon, 29 Jun 2015 06:03:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29047#M4221</guid>
      <dc:creator>Jason.Chen</dc:creator>
      <dc:date>2015-06-29T06:03:47Z</dc:date>
    </item>
    <item>
      <title>Re: Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29049#M4222</link>
      <description>&lt;P&gt;I see the same thing now. I bet that if you click through to the failed container you see an error like&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Diagnostics: Container [pid=13840,containerID=container_1435553713675_0018_02_000001] is running beyond physical memory limits. Current usage: 421.5 MB of 384 MB physical memory used; 2.7 GB of 806.4 MB virtual memory used. Killing container.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;If so then at least we have the cause. I see what is failing but not yet why as there's not a good reason the AM would only be allowed 384MB. It's a YARN config thing somewhere.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 29 Jun 2015 06:29:40 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29049#M4222</guid>
      <dc:creator>srowen</dc:creator>
      <dc:date>2015-06-29T06:29:40Z</dc:date>
    </item>
    <item>
      <title>Re: Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29059#M4223</link>
      <description>&lt;P&gt;This is the problem; fix coming momentarily:&lt;/P&gt;&lt;P&gt;&lt;A href="https://github.com/cloudera/oryx/issues/114" target="_blank"&gt;https://github.com/cloudera/oryx/issues/114&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I never saw a Snappy issue. I'm on CDH 5.4.2. Right now it seems to be running OK after the above.&lt;/P&gt;</description>
      <pubDate>Mon, 29 Jun 2015 07:18:03 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29059#M4223</guid>
      <dc:creator>srowen</dc:creator>
      <dc:date>2015-06-29T07:18:03Z</dc:date>
    </item>
    <item>
      <title>Re: Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29062#M4224</link>
      <description>&lt;P&gt;Interesting..&lt;/P&gt;&lt;P&gt;Is that actually the source of the problem ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I checked my log and there are no container errors info.&lt;/P&gt;&lt;P&gt;As I mentioned previously, the job did complete, but it complains cannot reach some servers during the process.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 29 Jun 2015 07:37:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29062#M4224</guid>
      <dc:creator>Jason.Chen</dc:creator>
      <dc:date>2015-06-29T07:37:14Z</dc:date>
    </item>
    <item>
      <title>Re: Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29063#M4225</link>
      <description>&lt;P&gt;It's pretty likely. It would not be in the logs but in the error shown on the attempt's (dead) container's info screen in the history server. At least, I saw the same thing exactly and this resolved it, and I can sort of see why this is&amp;nbsp;&lt;EM&gt;now&lt;/EM&gt; a problem in Java 7.&lt;/P&gt;</description>
      <pubDate>Mon, 29 Jun 2015 07:41:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29063#M4225</guid>
      <dc:creator>srowen</dc:creator>
      <dc:date>2015-06-29T07:41:04Z</dc:date>
    </item>
    <item>
      <title>Re: Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29161#M4226</link>
      <description>&lt;P&gt;Sean,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I applied your changes to our code base and still seeing the similar error (as below).&lt;/P&gt;&lt;P&gt;I checked the job by using the job tracking URL (e.g., http://server105:8088/proxy/application_1432750221048_0525/)&lt;/P&gt;&lt;P&gt;and actually there is no failed attempt.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;/// Logs ////&lt;/P&gt;&lt;P&gt;Thu May 28 07:27:57 PDT 2015 INFO Running job "Oryx-/user/xyz/int/def-1-122-Y-RowStep: Avro(hdfs://server105:8020/u... ID=1 (1/1)"&lt;BR /&gt;Thu May 28 07:27:57 PDT 2015 INFO Job status available at: http://server105:8088/proxy/application_1432750221048_0525/&lt;BR /&gt;Thu May 28 07:29:14 PDT 2015 INFO Retrying connect to server: server104/10.190.36.114:40915. Already tried 0 time(s); maxRetries=3&lt;/P&gt;&lt;P&gt;Thu May 28 07:29:14 PDT 2015 INFO Retrying connect to server: server104/10.190.36.114:40915. Already tried 1 time(s); maxRetries=3&lt;/P&gt;&lt;P&gt;Thu May 28 07:29:14 PDT 2015 INFO Retrying connect to server: server104/10.190.36.114:40915. Already tried 2 time(s); maxRetries=3&lt;BR /&gt;...&lt;BR /&gt;Thu May 28 07:34:15 PDT 2015 INFO Application state is completed. FinalApplicationStatus=SUCCEEDED. Redirecting to job history server&lt;BR /&gt;Thu May 28 07:34:16 PDT 2015 INFO Finished Oryx-/user/xyz/int/def-1-122-Y-RowStep&lt;BR /&gt;Thu May 28 07:34:16 PDT 2015 INFO Completed RowStep in 379s&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 01 Jul 2015 15:34:55 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29161#M4226</guid>
      <dc:creator>Jason.Chen</dc:creator>
      <dc:date>2015-07-01T15:34:55Z</dc:date>
    </item>
    <item>
      <title>Re: Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29164#M4227</link>
      <description>&lt;P&gt;Just to check, you have this commit right?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://github.com/cloudera/oryx/commit/4b5e557a36f3d666bab0befc21b79efdf1fcd52d" target="_blank"&gt;https://github.com/cloudera/oryx/commit/4b5e557a36f3d666bab0befc21b79efdf1fcd52d&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The symptom here is that the App Master for the MR job dies straight away, and can't be contacted. The important thing is to know why. For example when I looked at the AM app screen (i.e.&amp;nbsp;http://[host]:8088/cluster/app/application_1435553713675_0018) I saw something like ...&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;Application application_1435553713675_0018 failed 2 times due to AM Container for appattempt_1435553713675_0018_000002 exited with exitCode: -104&lt;/DIV&gt;&lt;DIV&gt;For more detailed output, check application tracking page:http://&lt;SPAN&gt;[host]&lt;/SPAN&gt;:8088/proxy/application_1435553713675_0018/Then, click on links to logs of each attempt.&lt;/DIV&gt;&lt;DIV&gt;Diagnostics: Container [pid=13840,containerID=container_1435553713675_0018_02_000001] is running beyond physical memory limits. Current usage: 421.5 MB of 384 MB physical memory used; 2.7 GB of 806.4 MB virtual memory used. Killing container.&lt;/DIV&gt;&lt;DIV&gt;...&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;Do you see anything like that that says why the AM stopped?&lt;/DIV&gt;</description>
      <pubDate>Wed, 01 Jul 2015 16:34:40 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29164#M4227</guid>
      <dc:creator>srowen</dc:creator>
      <dc:date>2015-07-01T16:34:40Z</dc:date>
    </item>
    <item>
      <title>Re: Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29188#M4228</link>
      <description>&lt;P&gt;&lt;BR /&gt;Yes, I applied your commit...&lt;BR /&gt;&lt;BR /&gt;I went to an example&lt;BR /&gt;http://[host]:8088/cluster/app/application_1435263631757_19721&lt;BR /&gt;But, I still not seeing the error.&lt;BR /&gt;&lt;BR /&gt;As I mentioned, the job/task is not really got killed or stopped. It just dropped some retrying info (as below), but it continues&lt;BR /&gt;&lt;BR /&gt;Thu May 28 07:29:14 PDT 2015 INFO Retrying connect to server: server104/10.190.36.114:40915. Already tried 0 time(s); maxRetries=3&lt;BR /&gt;Thu May 28 07:29:14 PDT 2015 INFO Retrying connect to server: server104/10.190.36.114:40915. Already tried 1 time(s); maxRetries=3&lt;/P&gt;</description>
      <pubDate>Thu, 02 Jul 2015 00:37:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29188#M4228</guid>
      <dc:creator>Jason.Chen</dc:creator>
      <dc:date>2015-07-02T00:37:48Z</dc:date>
    </item>
    <item>
      <title>Re: Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29196#M4229</link>
      <description>&lt;P&gt;Yes but the question is why. This is just a message from the driver program saying the master can't be found. The question is what happened to the Application Master. If you find it in YARN, can you see what happened to that container? it almost surely failed to start but why?&lt;/P&gt;</description>
      <pubDate>Thu, 02 Jul 2015 07:14:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29196#M4229</guid>
      <dc:creator>srowen</dc:creator>
      <dc:date>2015-07-02T07:14:47Z</dc:date>
    </item>
    <item>
      <title>Re: Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29259#M4230</link>
      <description>&lt;P&gt;Sean,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am not sure why.&lt;/P&gt;&lt;P&gt;But, it seems relating to firewall.&lt;/P&gt;&lt;P&gt;Our Oryx server is running in a virtiual Lan to talk to another virtual Lan firewall-ed.&lt;/P&gt;&lt;P&gt;It looks the dynamic port is because of ephemeral port and a bug&lt;/P&gt;&lt;P&gt;&lt;A href="https://issues.apache.org/jira/browse/MAPREDUCE-6338" target="_blank"&gt;https://issues.apache.org/jira/browse/MAPREDUCE-6338&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Still&amp;nbsp;digging this issue.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 06 Jul 2015 05:19:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29259#M4230</guid>
      <dc:creator>Jason.Chen</dc:creator>
      <dc:date>2015-07-06T05:19:23Z</dc:date>
    </item>
    <item>
      <title>Re: Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29266#M4231</link>
      <description>&lt;P&gt;Yes that could also be a cause. Is it possible to run the process inside the firewall? certainly the MapReduce jobs are intended to be managed by the Computation Layer from within the cluster.&lt;/P&gt;</description>
      <pubDate>Mon, 06 Jul 2015 07:51:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/29266#M4231</guid>
      <dc:creator>srowen</dc:creator>
      <dc:date>2015-07-06T07:51:25Z</dc:date>
    </item>
    <item>
      <title>Re: Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/31020#M4232</link>
      <description>&lt;P&gt;you had talk about many issues above, but I find it more related to oryx 1 and MR2.&lt;/P&gt;&lt;P&gt;I wonder whether it possible to run oryx2 outside a CDH cluster?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I deployed a hadoop2.6.0-CDH-5.4.4 cluster with zookeeper, kafka , spark on yarn and hdfs.&lt;/P&gt;&lt;P&gt;After I tried to run oryx2 on my laptop outside the cluster above(the same CDH version deployed but not running ),&lt;/P&gt;&lt;P&gt;&amp;nbsp;batch layer didn't print out as expected:&lt;/P&gt;&lt;P&gt;2015-08-20 23:45:39,278 INFO&amp;nbsp; BatchLayer:82 Creating message stream from topic&lt;BR /&gt;2015-08-20 23:45:39,531 INFO&amp;nbsp; AbstractSparkLayer:224 Initial offsets: {[OryxInput,0]=21642186}&lt;BR /&gt;2015-08-20 23:45:39,610 INFO&amp;nbsp; BatchLayer:117 Starting Spark Streaming&lt;BR /&gt;2015-08-20 23:45:39,677 INFO&amp;nbsp; BatchLayer:124 Spark Streaming is running&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;and it printed out exception at last :&lt;/P&gt;&lt;P&gt;Exception in thread "main" java.net.ConnectException: Call From m4040/192.168.88.46 to 0.0.0.0:8032 failed on connection exception: java.net.ConnectException: Connection refused; For more details see:&amp;nbsp; &lt;A href="http://wiki.apache.org/hadoop/ConnectionRefused" target="_blank"&gt;http://wiki.apache.org/hadoop/ConnectionRefused&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;On batch and speed web page, it showed like this:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;IMG src="https://community.cloudera.com/t5/image/serverpage/image-id/1098iD00E5033C9F3D81E/image-size/original?v=mpbl-1&amp;amp;px=-1" border="0" alt="batch.png" title="batch.png" /&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I guess my laptop could not communicate with kafka on cluster and this oryx job was rejected by yarn ?!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 21 Aug 2015 02:54:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/31020#M4232</guid>
      <dc:creator>horatio</dc:creator>
      <dc:date>2015-08-21T02:54:02Z</dc:date>
    </item>
    <item>
      <title>Re: Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/31029#M4233</link>
      <description>&lt;P&gt;You can run the binaries on any machine that can see the Hadoop configuration on the classpath, and which can access all of the services it needs to in the cluster. There are a number of services to talk to: HDFS, YARN, Kafka, Spark and the app's executors. So in general you'd have to have a lot of ports open, and at that point your machine is effectively a gateway node in the cluster. Certainly it's meant to be run within the cluster.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The serving layer only needs access to Kafka, and that's by design, so it might more easily run outside the cluster.&lt;/P&gt;</description>
      <pubDate>Fri, 21 Aug 2015 08:58:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/31029#M4233</guid>
      <dc:creator>srowen</dc:creator>
      <dc:date>2015-08-21T08:58:27Z</dc:date>
    </item>
    <item>
      <title>Re: Run Oryx on a machine that is not part of the cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/31071#M4234</link>
      <description>&lt;P&gt;Sean,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I tried to run Oryx in a node that in the same LAN as the Hadoop cluster.&lt;/P&gt;&lt;P&gt;We tested Oryx 1 fine without problems (we used to have firewall issue. After moving node to the same LAN as Hadoop cluster,&lt;/P&gt;&lt;P&gt;it runs fine)....&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We just start to test Oryx 2, using the same network (that's, no firewall issues).&lt;/P&gt;&lt;P&gt;I do have the /etc/hafoop/config in the node I am running Oryx 2.&lt;/P&gt;&lt;P&gt;However; I got the&amp;nbsp;following errors when starting Oryx 2 batch layer..&lt;/P&gt;&lt;P&gt;It looks it's looking for cloudera CDH jar files... Any thought? I need to&amp;nbsp;copy the jar files over ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;errors:&lt;/P&gt;&lt;P&gt;ls: cannot access /opt/cloudera/parcels/CDH/jars/zookeeper-*.jar: No such file or directory&lt;BR /&gt;ls: cannot access /opt/cloudera/parcels/CDH/jars/spark-assembly-*.jar: No such file or directory&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;&lt;P&gt;Jason&lt;/P&gt;</description>
      <pubDate>Sun, 23 Aug 2015 02:19:16 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Run-Oryx-on-a-machine-that-is-not-part-of-the-cluster/m-p/31071#M4234</guid>
      <dc:creator>JasonChen</dc:creator>
      <dc:date>2015-08-23T02:19:16Z</dc:date>
    </item>
  </channel>
</rss>

