<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Spark Java Accumulator not incrementing in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Spark-Java-Accumulator-not-incrementing/m-p/112626#M75454</link>
    <description>&lt;P&gt;I am making a runnable jar and submit the job on Hortonworks Sandbox 2.4 using spark-submit jobname.jar&lt;/P&gt;</description>
    <pubDate>Wed, 01 Jun 2016 11:55:52 GMT</pubDate>
    <dc:creator>arunak</dc:creator>
    <dc:date>2016-06-01T11:55:52Z</dc:date>
    <item>
      <title>Spark Java Accumulator not incrementing</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Spark-Java-Accumulator-not-incrementing/m-p/112625#M75453</link>
      <description>&lt;P&gt;Just started with baby steps in Spark-Java. Below is a word count program that includes a stop word list that would skip words that are in the list. I have 2 accumulators to count the skipped words and unskipped words.&lt;/P&gt;&lt;P&gt;However, the &lt;CODE&gt;Sysout&lt;/CODE&gt; at the end of program &lt;STRONG&gt;always gives both accumulator values to be 0&lt;/STRONG&gt;.&lt;/P&gt;&lt;P&gt;Please point out where I am going wrong.&lt;/P&gt;&lt;PRE&gt;public static void main(String[] args) throws FileNotFoundException {
        SparkConf conf = new SparkConf();
        conf.setAppName("Third App - Word Count WITH BroadCast and Accumulator");
        JavaSparkContext jsc = new JavaSparkContext(conf);
        JavaRDD&amp;lt;String&amp;gt; fileRDD = jsc.textFile("hello.txt");
        JavaRDD&amp;lt;String&amp;gt; words = fileRDD.flatMap(new FlatMapFunction&amp;lt;String, String&amp;gt;() {
            public Iterable&amp;lt;String&amp;gt; call(String aLine) throws Exception {
                return Arrays.asList(aLine.split(" "));
            }
        });
        String[] stopWordArray = getStopWordArray();
         final Accumulator&amp;lt;Integer&amp;gt; skipAccumulator = jsc.accumulator(0);
         final Accumulator&amp;lt;Integer&amp;gt; unSkipAccumulator = jsc.accumulator(0);
        final Broadcast&amp;lt;String[]&amp;gt; stopWordBroadCast = jsc.broadcast(stopWordArray);
        JavaRDD&amp;lt;String&amp;gt; filteredWords = words.filter(new Function&amp;lt;String, Boolean&amp;gt;() {
            public Boolean call(String inString) throws Exception {
                boolean filterCondition = !Arrays.asList(stopWordBroadCast.getValue()).contains(inString);
                if(!filterCondition){
                    System.out.println("Filtered a stop word ");
                    skipAccumulator.add(1);
                }else{
                    unSkipAccumulator.add(1);
                }
                return filterCondition;
            }
        });
        System.out.println("$$$$$$$$Filtered Count "+skipAccumulator.value());
        System.out.println("$$$$$$$$ UN Filtered Count "+unSkipAccumulator.value());
        /* rest of code - works fine */
        jsc.stop();
        jsc.close();
        }&lt;/PRE&gt;</description>
      <pubDate>Wed, 01 Jun 2016 11:54:53 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Spark-Java-Accumulator-not-incrementing/m-p/112625#M75453</guid>
      <dc:creator>arunak</dc:creator>
      <dc:date>2016-06-01T11:54:53Z</dc:date>
    </item>
    <item>
      <title>Re: Spark Java Accumulator not incrementing</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Spark-Java-Accumulator-not-incrementing/m-p/112626#M75454</link>
      <description>&lt;P&gt;I am making a runnable jar and submit the job on Hortonworks Sandbox 2.4 using spark-submit jobname.jar&lt;/P&gt;</description>
      <pubDate>Wed, 01 Jun 2016 11:55:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Spark-Java-Accumulator-not-incrementing/m-p/112626#M75454</guid>
      <dc:creator>arunak</dc:creator>
      <dc:date>2016-06-01T11:55:52Z</dc:date>
    </item>
    <item>
      <title>Re: Spark Java Accumulator not incrementing</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Spark-Java-Accumulator-not-incrementing/m-p/112627#M75455</link>
      <description>&lt;PRE&gt;++ tried out the same in local mode on eclipse using JavaSparkContext jsc = new JavaSparkContext("local[*]","Application name"); but still the same result. &lt;/PRE&gt;</description>
      <pubDate>Wed, 01 Jun 2016 12:00:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Spark-Java-Accumulator-not-incrementing/m-p/112627#M75455</guid>
      <dc:creator>arunak</dc:creator>
      <dc:date>2016-06-01T12:00:36Z</dc:date>
    </item>
    <item>
      <title>Re: Spark Java Accumulator not incrementing</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Spark-Java-Accumulator-not-incrementing/m-p/112628#M75456</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/10529/akeezhadath.html" nodeid="10529"&gt;@akeezhadath&lt;/A&gt; it seems that you are not calling action which actually don't trigger the job. spark actions are lazily evaluted ,can you run some terminal operation on the filterwords like count or collect and see if you are able to see the incremented value of accumulators.&lt;/P&gt;</description>
      <pubDate>Wed, 01 Jun 2016 12:16:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Spark-Java-Accumulator-not-incrementing/m-p/112628#M75456</guid>
      <dc:creator>rajkumar_singh</dc:creator>
      <dc:date>2016-06-01T12:16:07Z</dc:date>
    </item>
    <item>
      <title>Re: Spark Java Accumulator not incrementing</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Spark-Java-Accumulator-not-incrementing/m-p/112629#M75457</link>
      <description>&lt;P&gt;Good Point!! Let me try that &lt;A rel="user" href="https://community.cloudera.com/users/8919/rajkumarsingh.html" nodeid="8919"&gt;@Rajkumar Singh&lt;/A&gt;. &lt;/P&gt;</description>
      <pubDate>Wed, 01 Jun 2016 12:16:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Spark-Java-Accumulator-not-incrementing/m-p/112629#M75457</guid>
      <dc:creator>arunak</dc:creator>
      <dc:date>2016-06-01T12:16:08Z</dc:date>
    </item>
    <item>
      <title>Re: Spark Java Accumulator not incrementing</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Spark-Java-Accumulator-not-incrementing/m-p/112630#M75458</link>
      <description>&lt;P&gt;Got it, added an action first() to make it forcefully trigger. And yes, the reason that you mentioned "s&lt;STRONG&gt;park actions are lazily evaluted&lt;/STRONG&gt;" was what stopped me. &lt;/P&gt;</description>
      <pubDate>Wed, 01 Jun 2016 12:19:26 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Spark-Java-Accumulator-not-incrementing/m-p/112630#M75458</guid>
      <dc:creator>arunak</dc:creator>
      <dc:date>2016-06-01T12:19:26Z</dc:date>
    </item>
  </channel>
</rss>

