<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Log managmement  for Long-running Spark Streaming Jobs on YARN Cluster in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51286#M54948</link>
    <description>Sorry wrong setting.&lt;BR /&gt;&lt;BR /&gt;yarn.nodemanager.log-aggregation.roll-monitoring-interval-seconds</description>
    <pubDate>Tue, 21 Feb 2017 22:02:42 GMT</pubDate>
    <dc:creator>mbigelow</dc:creator>
    <dc:date>2017-02-21T22:02:42Z</dc:date>
    <item>
      <title>Log managmement  for Long-running Spark Streaming Jobs on YARN Cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51200#M54937</link>
      <description>&lt;P&gt;Hi, We need to find a way to maintain and search logs for the Long running Sprk streaming jobs on YARN. We have Log aggregation disabled in our cluster. We are thinking about Solr/Elastic search and may be Flume or Kafka to read the Sprk job logs.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;any suggestions on how to implement search the on these logs and easily manage them?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Suri&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 11:07:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51200#M54937</guid>
      <dc:creator>SuriNuthalapati</dc:creator>
      <dc:date>2022-09-16T11:07:15Z</dc:date>
    </item>
    <item>
      <title>Re: Log managmement  for Long-running Spark Streaming Jobs on YARN Cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51216#M54938</link>
      <description>I am assuming that log aggregation was turned off as it doesn't trigger until a job completes which is useless for long running/streaming jobs. I recommend turning it back on and using yarn.log-aggregation.retain-check-interval-seconds to have the logs collected up on a regular basis.&lt;BR /&gt;&lt;BR /&gt;Solr/ES is really good for the counters/metrics and could be used for the logs as well.</description>
      <pubDate>Tue, 21 Feb 2017 08:26:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51216#M54938</guid>
      <dc:creator>mbigelow</dc:creator>
      <dc:date>2017-02-21T08:26:23Z</dc:date>
    </item>
    <item>
      <title>Re: Log managmement  for Long-running Spark Streaming Jobs on YARN Cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51231#M54939</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/18127"&gt;@mbigelow&lt;/a&gt;&amp;nbsp;You are right. We turned it off because of the long runnig jobs.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Do you know any other ways to implement log serach other than Solr/elastic?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Suri&lt;/P&gt;</description>
      <pubDate>Tue, 21 Feb 2017 15:26:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51231#M54939</guid>
      <dc:creator>SuriNuthalapati</dc:creator>
      <dc:date>2017-02-21T15:26:56Z</dc:date>
    </item>
    <item>
      <title>Re: Log managmement  for Long-running Spark Streaming Jobs on YARN Cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51235#M54940</link>
      <description>What are you trying to achieve exactly? Do you just want to be able to search through the logs for key phrases? Do you want to all basic users to search the raw logs? Are you trying to hunt down problematic jobs?</description>
      <pubDate>Tue, 21 Feb 2017 16:13:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51235#M54940</guid>
      <dc:creator>mbigelow</dc:creator>
      <dc:date>2017-02-21T16:13:22Z</dc:date>
    </item>
    <item>
      <title>Re: Log managmement  for Long-running Spark Streaming Jobs on YARN Cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51240#M54941</link>
      <description>&lt;P&gt;We eant to searh for key phrases and at the same time we want developers to look in to the raw logs too for their troubleshooting and alerts for specific errors.&lt;/P&gt;</description>
      <pubDate>Tue, 21 Feb 2017 16:33:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51240#M54941</guid>
      <dc:creator>SuriNuthalapati</dc:creator>
      <dc:date>2017-02-21T16:33:17Z</dc:date>
    </item>
    <item>
      <title>Re: Log managmement  for Long-running Spark Streaming Jobs on YARN Cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51255#M54942</link>
      <description>You could do this in many ways. You could just load it in Solr/ES and go to town. Hive would not be a great fit but I could see some tables being build around specific data like job counters or metrics. MR jobs could be build to pull out specific data (possible to load into a Hive table) or Spark jobs (and the Spark shell can be used to explore there raw data). And simple tools like grep, awk, etc. can be used as the individual logs, when aggregated, are available to the user.&lt;BR /&gt;&lt;BR /&gt;If you have CM, the YARN application screen for a cluster, I'm pretty sure, is built using an embedded Solr and gives you and idea of what could be done. This is more around metrics and job counters again.</description>
      <pubDate>Tue, 21 Feb 2017 19:32:51 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51255#M54942</guid>
      <dc:creator>mbigelow</dc:creator>
      <dc:date>2017-02-21T19:32:51Z</dc:date>
    </item>
    <item>
      <title>Re: Log managmement  for Long-running Spark Streaming Jobs on YARN Cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51272#M54943</link>
      <description>&lt;P&gt;The documentation for YARN log aggregation says that logs are aggregated after an application completes.&lt;/P&gt;&lt;P&gt;Streaming jobs run for a much longer duration and potentially don't ever terminate. I want to get the logs into HDFS for my streaming jobs before the application completes or terminates. What are the better ways to do it, since Log aggregation only do it after the jobs are completed.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Suri&lt;/P&gt;</description>
      <pubDate>Tue, 21 Feb 2017 21:15:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51272#M54943</guid>
      <dc:creator>SuriNuthalapati</dc:creator>
      <dc:date>2017-02-21T21:15:18Z</dc:date>
    </item>
    <item>
      <title>Re: Log managmement  for Long-running Spark Streaming Jobs on YARN Cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51274#M54944</link>
      <description>This got lost in my earlier reply...&lt;BR /&gt;&lt;BR /&gt;yarn.log-aggregation.retain-check-interval-seconds&lt;BR /&gt;&lt;BR /&gt;This determine when it checks if logs need to be aggregated. By default it is 0 which means it doesn't check and a job must finish. This will allow it to collect the logs for jobs that, in theory, won't end.</description>
      <pubDate>Tue, 21 Feb 2017 21:21:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51274#M54944</guid>
      <dc:creator>mbigelow</dc:creator>
      <dc:date>2017-02-21T21:21:47Z</dc:date>
    </item>
    <item>
      <title>Re: Log managmement  for Long-running Spark Streaming Jobs on YARN Cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51281#M54945</link>
      <description>&lt;P&gt;Thanks,&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/18127"&gt;@mbigelow&lt;/a&gt;.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So, if I set&amp;nbsp;&lt;SPAN&gt;yarn.log-aggregation.retain-check-interval-seconds to 60 Seconds, It will send the logs to HDFS (every 60 seconds) even when the job was not finished? (Since streaming jobs run forever)&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Suri&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 21 Feb 2017 21:46:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51281#M54945</guid>
      <dc:creator>SuriNuthalapati</dc:creator>
      <dc:date>2017-02-21T21:46:04Z</dc:date>
    </item>
    <item>
      <title>Re: Log managmement  for Long-running Spark Streaming Jobs on YARN Cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51283#M54946</link>
      <description>yes, that is pretty frequent though so I don't know how it will go. I'd be interested to know.</description>
      <pubDate>Tue, 21 Feb 2017 21:50:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51283#M54946</guid>
      <dc:creator>mbigelow</dc:creator>
      <dc:date>2017-02-21T21:50:05Z</dc:date>
    </item>
    <item>
      <title>Re: Log managmement  for Long-running Spark Streaming Jobs on YARN Cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51285#M54947</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/18127"&gt;@mbigelow&lt;/a&gt;&amp;nbsp;but from some other sources they said&lt;SPAN&gt;&amp;nbsp;"set the &lt;/SPAN&gt;&lt;SPAN&gt;yarn.log-aggregation.retain-check-interval-seconds to&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;specify how often the log retention check should be run. By default, it is&amp;nbsp;one-tenth of the log retention time" - What I understood from this was, it will only check for the retenstion and may not aggregate the logs based on that interval. Did I understood it correct?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Suri&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 21 Feb 2017 21:53:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51285#M54947</guid>
      <dc:creator>SuriNuthalapati</dc:creator>
      <dc:date>2017-02-21T21:53:22Z</dc:date>
    </item>
    <item>
      <title>Re: Log managmement  for Long-running Spark Streaming Jobs on YARN Cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51286#M54948</link>
      <description>Sorry wrong setting.&lt;BR /&gt;&lt;BR /&gt;yarn.nodemanager.log-aggregation.roll-monitoring-interval-seconds</description>
      <pubDate>Tue, 21 Feb 2017 22:02:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51286#M54948</guid>
      <dc:creator>mbigelow</dc:creator>
      <dc:date>2017-02-21T22:02:42Z</dc:date>
    </item>
    <item>
      <title>Re: Log managmement  for Long-running Spark Streaming Jobs on YARN Cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51288#M54949</link>
      <description>Thank you, I Will try it out.</description>
      <pubDate>Tue, 21 Feb 2017 22:17:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/51288#M54949</guid>
      <dc:creator>SuriNuthalapati</dc:creator>
      <dc:date>2017-02-21T22:17:15Z</dc:date>
    </item>
    <item>
      <title>Re: Log managmement  for Long-running Spark Streaming Jobs on YARN Cluster</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/53687#M54951</link>
      <description>&lt;P&gt;It's true that you can aggreate logs to hdfs when the job is still running, however, the minimun log uploading interval (yarn.nodemanager.log-aggregation.roll-monitoring-interval-seconds) you can set is 3600 seconds which is 1 hour. The design is trying to protect namenode from being spamed.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You may have to use an external service to do the log &lt;SPAN&gt;aggregation. Either write your own or find other tools.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Below is the proof from yarn-default.xml in hadoop-common source code (cdh5-2.6.0_5.7.1).&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;&lt;SPAN&gt;&amp;lt;property&amp;gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;  &amp;lt;description&amp;gt;&lt;/SPAN&gt;Defines how often NMs wake up to upload log files.&lt;BR /&gt;  The default value is -1. By default, the logs will be uploaded when&lt;BR /&gt;  the application is finished. By setting this configure, logs can be uploaded&lt;BR /&gt;  periodically when the application is running. The minimum rolling-interval-seconds&lt;BR /&gt;  can be set is 3600.&lt;BR /&gt;  &lt;SPAN&gt;&amp;lt;/description&amp;gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;  &amp;lt;name&amp;gt;&lt;/SPAN&gt;yarn.nodemanager.log-aggregation.roll-monitoring-interval-seconds&lt;SPAN&gt;&amp;lt;/name&amp;gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;  &amp;lt;value&amp;gt;&lt;/SPAN&gt;-1&lt;SPAN&gt;&amp;lt;/value&amp;gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&amp;lt;/property&amp;gt;&lt;/SPAN&gt;&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 14 Apr 2017 17:38:30 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Log-managmement-for-Long-running-Spark-Streaming-Jobs-on/m-p/53687#M54951</guid>
      <dc:creator>shuffle</dc:creator>
      <dc:date>2017-04-14T17:38:30Z</dc:date>
    </item>
  </channel>
</rss>

