<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Slow distcp job termination in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Slow-distcp-job-termination/m-p/414195#M255203</link>
    <description>&lt;P&gt;And checking RM there is a huge number of call to the AM , one each 10 seconds:&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;2026-06-05 11:09:39,050 INFO org.apache.hadoop.yarn.server.resourcemanager.RMAuditLogger: USER=arcgis   OPERATION=AM Released Container TARGET=SchedulerApp     RESULT=SUCCESS  APPID=application_1780210407885_1976    CONTAINERID=container_e22_1780210407885_1976_01_000014  RESOURCE=&amp;lt;memory:2048, vCores:1&amp;gt;        QUEUENAME=arcgis
2026-06-05 11:09:40,102 INFO org.apache.hadoop.yarn.server.webproxy.WebAppProxyServlet: null is accessing unchecked http://almapwrk15.data.com:34620/ws/v1/mapreduce/jobs/job_1780210407885_1976 which is the app master GUI of application_1780210407885_1976 owned by arcgis
2026-06-05 11:09:50,116 INFO org.apache.hadoop.yarn.server.webproxy.WebAppProxyServlet: null is accessing unchecked http://almapwrk15.data.com:34620/ws/v1/mapreduce/jobs/job_1780210407885_1976 which is the app master GUI of application_1780210407885_1976 owned by arcgis
.....
2026-06-05 11:17:41,013 INFO org.apache.hadoop.yarn.server.webproxy.WebAppProxyServlet: null is accessing unchecked http://almapwrk15.data.com:34620/ws/v1/mapreduce/jobs/job_1780210407885_1976 which is the app master GUI of application_1780210407885_1976 owned by arcgis
2026-06-05 11:17:49,422 INFO org.apache.hadoop.yarn.server.resourcemanager.rmapp.attempt.RMAppAttemptImpl: Updating application attempt appattempt_1780210407885_1976_000001 with final state: FINISHING, and exit status: -1000
2026-06-05 11:17:49,422 INFO org.apache.hadoop.yarn.server.resourcemanager.rmapp.attempt.RMAppAttemptImpl: appattempt_1780210407885_1976_000001 State change from RUNNING to FINAL_SAVING on event = UNREGISTERED
2026-06-05 11:17:49,422 INFO org.apache.hadoop.yarn.server.resourcemanager.rmapp.RMAppImpl: Updating application application_1780210407885_1976 with final state: FINISHING
2026-06-05 11:17:49,422 INFO org.apache.hadoop.yarn.server.resourcemanager.rmapp.RMAppImpl: application_1780210407885_1976 State change from RUNNING to FINAL_SAVING on event = ATTEMPT_UNREGISTERED
2026-06-05 11:17:49,433 INFO org.apache.hadoop.yarn.server.resourcemanager.recovery.RMStateStore: Updating info for app: application_1780210407885_1976
2026-06-05 11:17:49,433 INFO org.apache.hadoop.yarn.server.resourcemanager.rmapp.attempt.RMAppAttemptImpl: appattempt_1780210407885_1976_000001 State change from FINAL_SAVING to FINISHING on event = ATTEMPT_UPDATE_SAVED&lt;/LI-CODE&gt;</description>
    <pubDate>Fri, 05 Jun 2026 10:03:25 GMT</pubDate>
    <dc:creator>ganzuoni</dc:creator>
    <dc:date>2026-06-05T10:03:25Z</dc:date>
    <item>
      <title>Slow distcp job termination</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Slow-distcp-job-termination/m-p/414193#M255201</link>
      <description>&lt;P&gt;I'm copying several directories with several files each&lt;/P&gt;&lt;P&gt;Yarn application full termination seems affected by the number of object transferred&lt;/P&gt;&lt;P&gt;In this case we have 9 minutes between 100% completion and job termination&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;26/06/05 10:47:20 INFO mapreduce.Job: map 100% reduce 0%
26/06/05 10:56:25 INFO mapreduce.Job: Job job_1780210407885_1954 completed successfully
with
Files Copied=68744
DIR_COPY=23058&lt;/LI-CODE&gt;&lt;P&gt;while here few seconds&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;26/06/05 10:43:11 INFO mapreduce.Job:  map 100% reduce 0%
26/06/05 10:43:32 INFO mapreduce.Job: Job job_1780210407885_1958 completed successfully
with
Files Copied=12300
DIR_COPY=191&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;Is the any explanation for such a behaviour?&lt;/P&gt;</description>
      <pubDate>Fri, 05 Jun 2026 09:19:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Slow-distcp-job-termination/m-p/414193#M255201</guid>
      <dc:creator>ganzuoni</dc:creator>
      <dc:date>2026-06-05T09:19:48Z</dc:date>
    </item>
    <item>
      <title>Re: Slow distcp job termination</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Slow-distcp-job-termination/m-p/414194#M255202</link>
      <description>&lt;P&gt;Checking node manager logs the time spent is between container removal and container success declaration&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;2026-06-05 11:08:15,452 INFO org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl: Removed completed containers from NM context: [container_e22_1780210407885_1976_01_000013]
2026-06-05 11:17:55,672 INFO org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch: Container container_e22_1780210407885_1976_01_000001 succeeded&lt;/LI-CODE&gt;</description>
      <pubDate>Fri, 05 Jun 2026 09:31:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Slow-distcp-job-termination/m-p/414194#M255202</guid>
      <dc:creator>ganzuoni</dc:creator>
      <dc:date>2026-06-05T09:31:57Z</dc:date>
    </item>
    <item>
      <title>Re: Slow distcp job termination</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Slow-distcp-job-termination/m-p/414195#M255203</link>
      <description>&lt;P&gt;And checking RM there is a huge number of call to the AM , one each 10 seconds:&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;2026-06-05 11:09:39,050 INFO org.apache.hadoop.yarn.server.resourcemanager.RMAuditLogger: USER=arcgis   OPERATION=AM Released Container TARGET=SchedulerApp     RESULT=SUCCESS  APPID=application_1780210407885_1976    CONTAINERID=container_e22_1780210407885_1976_01_000014  RESOURCE=&amp;lt;memory:2048, vCores:1&amp;gt;        QUEUENAME=arcgis
2026-06-05 11:09:40,102 INFO org.apache.hadoop.yarn.server.webproxy.WebAppProxyServlet: null is accessing unchecked http://almapwrk15.data.com:34620/ws/v1/mapreduce/jobs/job_1780210407885_1976 which is the app master GUI of application_1780210407885_1976 owned by arcgis
2026-06-05 11:09:50,116 INFO org.apache.hadoop.yarn.server.webproxy.WebAppProxyServlet: null is accessing unchecked http://almapwrk15.data.com:34620/ws/v1/mapreduce/jobs/job_1780210407885_1976 which is the app master GUI of application_1780210407885_1976 owned by arcgis
.....
2026-06-05 11:17:41,013 INFO org.apache.hadoop.yarn.server.webproxy.WebAppProxyServlet: null is accessing unchecked http://almapwrk15.data.com:34620/ws/v1/mapreduce/jobs/job_1780210407885_1976 which is the app master GUI of application_1780210407885_1976 owned by arcgis
2026-06-05 11:17:49,422 INFO org.apache.hadoop.yarn.server.resourcemanager.rmapp.attempt.RMAppAttemptImpl: Updating application attempt appattempt_1780210407885_1976_000001 with final state: FINISHING, and exit status: -1000
2026-06-05 11:17:49,422 INFO org.apache.hadoop.yarn.server.resourcemanager.rmapp.attempt.RMAppAttemptImpl: appattempt_1780210407885_1976_000001 State change from RUNNING to FINAL_SAVING on event = UNREGISTERED
2026-06-05 11:17:49,422 INFO org.apache.hadoop.yarn.server.resourcemanager.rmapp.RMAppImpl: Updating application application_1780210407885_1976 with final state: FINISHING
2026-06-05 11:17:49,422 INFO org.apache.hadoop.yarn.server.resourcemanager.rmapp.RMAppImpl: application_1780210407885_1976 State change from RUNNING to FINAL_SAVING on event = ATTEMPT_UNREGISTERED
2026-06-05 11:17:49,433 INFO org.apache.hadoop.yarn.server.resourcemanager.recovery.RMStateStore: Updating info for app: application_1780210407885_1976
2026-06-05 11:17:49,433 INFO org.apache.hadoop.yarn.server.resourcemanager.rmapp.attempt.RMAppAttemptImpl: appattempt_1780210407885_1976_000001 State change from FINAL_SAVING to FINISHING on event = ATTEMPT_UPDATE_SAVED&lt;/LI-CODE&gt;</description>
      <pubDate>Fri, 05 Jun 2026 10:03:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Slow-distcp-job-termination/m-p/414195#M255203</guid>
      <dc:creator>ganzuoni</dc:creator>
      <dc:date>2026-06-05T10:03:25Z</dc:date>
    </item>
    <item>
      <title>Re: Slow distcp job termination</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Slow-distcp-job-termination/m-p/414196#M255204</link>
      <description>&lt;P&gt;Anyway, everything seems related to logs collection for JobHistory&lt;/P&gt;</description>
      <pubDate>Fri, 05 Jun 2026 10:04:11 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Slow-distcp-job-termination/m-p/414196#M255204</guid>
      <dc:creator>ganzuoni</dc:creator>
      <dc:date>2026-06-05T10:04:11Z</dc:date>
    </item>
  </channel>
</rss>

