<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Mapreduce - GC overhead limit exceeded in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Mapreduce-GC-overhead-limit-exceeded/m-p/146204#M32346</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;When we run a mapreduce job we re getting  GC overhead limit exceeded error during the map phase and the job gets terminated. Please let us know how this can be resolved?&lt;/P&gt;&lt;P&gt;Error: GC overhead limit exceeded
16/06/19 17:34:39 INFO mapreduce.Job:  map 18% reduce 0%
16/06/19 17:36:42 INFO mapreduce.Job:  map 19% reduce 0%
16/06/19 17:37:18 INFO mapreduce.Job: Task Id : attempt_1466342436828_0001_m_000008_2, Status : FAILED
Error: Java heap space&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Venkadesh S&lt;/P&gt;</description>
    <pubDate>Sun, 19 Jun 2016 21:12:38 GMT</pubDate>
    <dc:creator>vsivalingam</dc:creator>
    <dc:date>2016-06-19T21:12:38Z</dc:date>
    <item>
      <title>Mapreduce - GC overhead limit exceeded</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Mapreduce-GC-overhead-limit-exceeded/m-p/146204#M32346</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;When we run a mapreduce job we re getting  GC overhead limit exceeded error during the map phase and the job gets terminated. Please let us know how this can be resolved?&lt;/P&gt;&lt;P&gt;Error: GC overhead limit exceeded
16/06/19 17:34:39 INFO mapreduce.Job:  map 18% reduce 0%
16/06/19 17:36:42 INFO mapreduce.Job:  map 19% reduce 0%
16/06/19 17:37:18 INFO mapreduce.Job: Task Id : attempt_1466342436828_0001_m_000008_2, Status : FAILED
Error: Java heap space&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Venkadesh S&lt;/P&gt;</description>
      <pubDate>Sun, 19 Jun 2016 21:12:38 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Mapreduce-GC-overhead-limit-exceeded/m-p/146204#M32346</guid>
      <dc:creator>vsivalingam</dc:creator>
      <dc:date>2016-06-19T21:12:38Z</dc:date>
    </item>
    <item>
      <title>Re: Mapreduce - GC overhead limit exceeded</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Mapreduce-GC-overhead-limit-exceeded/m-p/146205#M32347</link>
      <description>&lt;P&gt;looks your mapred.child.java.opts is insufficient to run the job,try running this job again after increasing mapred.child.java.opts value.&lt;/P&gt;</description>
      <pubDate>Sun, 19 Jun 2016 21:22:00 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Mapreduce-GC-overhead-limit-exceeded/m-p/146205#M32347</guid>
      <dc:creator>rajkumar_singh</dc:creator>
      <dc:date>2016-06-19T21:22:00Z</dc:date>
    </item>
    <item>
      <title>Re: Mapreduce - GC overhead limit exceeded</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Mapreduce-GC-overhead-limit-exceeded/m-p/146206#M32348</link>
      <description>&lt;P&gt;Sounds more like your map task is not very efficient. What are you doing in it? The second thing I could see is if the sort memory is too small. But I would mostly look at you map code.&lt;/P&gt;&lt;P&gt;&lt;A href="http://stackoverflow.com/questions/5839359/java-lang-outofmemoryerror-gc-overhead-limit-exceeded" target="_blank"&gt;http://stackoverflow.com/questions/5839359/java-lang-outofmemoryerror-gc-overhead-limit-exceeded&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 19 Jun 2016 21:23:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Mapreduce-GC-overhead-limit-exceeded/m-p/146206#M32348</guid>
      <dc:creator>bleonhardi</dc:creator>
      <dc:date>2016-06-19T21:23:52Z</dc:date>
    </item>
    <item>
      <title>Re: Mapreduce - GC overhead limit exceeded</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Mapreduce-GC-overhead-limit-exceeded/m-p/146207#M32349</link>
      <description>&lt;P&gt;Thanks ..&lt;A href="https://community.hortonworks.com/questions/40612/mapreduce-gc-overhead-limit-exceeded.html#"&gt;@Rajkumar Singh&lt;/A&gt; ..  &lt;A href="https://community.hortonworks.com/questions/40612/mapreduce-gc-overhead-limit-exceeded.html#"&gt;@Benjamin Leonhardi&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Below are my settings in the cluster.&lt;/P&gt;&lt;P&gt;Map Memory : 8192&lt;/P&gt;&lt;P&gt;Sort Allocation Memory : 2047&lt;/P&gt;&lt;P&gt;MR Map Java Heap Size : -Xmx8192mmapreduce.admin.map.child.java.opts  &amp;amp; mapred.child.java.opts  :      -server -XX:NewRatio=8 -Djava.net.preferIPv4Stack=true -Dhdp.version=${hdp.version}&lt;/P&gt;&lt;P&gt;I havent found mapred.child.java.opts through Ambari.&lt;/P&gt;</description>
      <pubDate>Sun, 19 Jun 2016 21:32:40 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Mapreduce-GC-overhead-limit-exceeded/m-p/146207#M32349</guid>
      <dc:creator>vsivalingam</dc:creator>
      <dc:date>2016-06-19T21:32:40Z</dc:date>
    </item>
    <item>
      <title>Re: Mapreduce - GC overhead limit exceeded</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Mapreduce-GC-overhead-limit-exceeded/m-p/146208#M32350</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/168/bleonhardi.html" nodeid="168"&gt;@Benjamin Leonhardi&lt;/A&gt; I am trying to read a CSV file of total size around 50 GB. Around 310 splits get created but I have only 3 maps in running status at a time eventhough I have four datanodes. Each of the datanode has 16 GB RAM and one disk and &lt;STRONG&gt;Cores (CPU):&lt;/STRONG&gt;2 (2) . I am using CSVNLineInputFormat from  (https://github.com/mvallebr/CSVInputFormat/blob/master/src/main/java/org/apache/hadoop/mapreduce/lib/input/CSVNLineInputFormat.java) to red my CSV files.&lt;/P&gt;</description>
      <pubDate>Sun, 19 Jun 2016 21:39:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Mapreduce-GC-overhead-limit-exceeded/m-p/146208#M32350</guid>
      <dc:creator>vsivalingam</dc:creator>
      <dc:date>2016-06-19T21:39:21Z</dc:date>
    </item>
    <item>
      <title>Re: Mapreduce - GC overhead limit exceeded</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Mapreduce-GC-overhead-limit-exceeded/m-p/146209#M32351</link>
      <description>&lt;P&gt;normally mapper dont fail with OOM and 8192M is pretty good, I suspect that if you have some big records while reading from csv, are you doing some memory intensive operation inside mapper. could you please share the task log for this attempt attempt_1466342436828_0001_m_000008_2&lt;/P&gt;</description>
      <pubDate>Sun, 19 Jun 2016 22:02:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Mapreduce-GC-overhead-limit-exceeded/m-p/146209#M32351</guid>
      <dc:creator>rajkumar_singh</dc:creator>
      <dc:date>2016-06-19T22:02:35Z</dc:date>
    </item>
  </channel>
</rss>

