<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: I just start learning kafka and spark. I am trying to run a simple program for kafka consumer which saves data in spark DataFrame. But, I get the following error. in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143725#M19718</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/2833/hodamoradi2014.html" nodeid="2833"&gt;@hoda moradi&lt;/A&gt; What HDP version are you using? I use this &lt;A href="http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.4/bk_installing_manually_book/content/ch_installing_kafka_chapter.html" target="_blank"&gt;http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.4/bk_installing_manually_book/content/ch_installing_kafka_chapter.html&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 18 Feb 2016 02:41:56 GMT</pubDate>
    <dc:creator>nsabharwal</dc:creator>
    <dc:date>2016-02-18T02:41:56Z</dc:date>
    <item>
      <title>I just start learning kafka and spark. I am trying to run a simple program for kafka consumer which saves data in spark DataFrame. But, I get the following error.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143720#M19713</link>
      <description>&lt;P&gt;I get following error: &lt;/P&gt;&lt;PRE&gt;Exception in thread "main" java.lang.NoClassDefFoundError: scala/reflect/ClassManifest
at kafka.utils.Log4jController$.&amp;lt;init&amp;gt;(Log4jController.scala:29)
at kafka.utils.Log4jController$.&amp;lt;clinit&amp;gt;(Log4jController.scala)
at kafka.utils.Logging$class.$init$(Logging.scala:29)
at kafka.utils.VerifiableProperties.&amp;lt;init&amp;gt;(VerifiableProperties.scala:24)
at kafka.consumer.ConsumerConfig.&amp;lt;init&amp;gt;(ConsumerConfig.scala:78)
at net.mynet.ReadFileConsumer.main(ReadFileConsumer.java:42)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at org.apache.spark.deploy.SparkSubmit$.org$apache$spark$deploy$SparkSubmit$$runMain(SparkSubmit.scala:665)
at org.apache.spark.deploy.SparkSubmit$.doRunMain$1(SparkSubmit.scala:170)
at org.apache.spark.deploy.SparkSubmit$.submit(SparkSubmit.scala:193)
at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:112)
at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala)
Caused by: java.lang.ClassNotFoundException: scala.reflect.ClassManifest
at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
... 15 more
&lt;STRONG&gt;Here is my dependancies:
&lt;/STRONG&gt;&amp;lt;dependencies&amp;gt;
&amp;lt;dependency&amp;gt;
    &amp;lt;groupId&amp;gt;org.apache.kafka&amp;lt;/groupId&amp;gt;
&amp;lt;artifactId&amp;gt;kafka_2.9.2&amp;lt;/artifactId&amp;gt;
&amp;lt;version&amp;gt;0.8.1.1&amp;lt;/version&amp;gt;
    &amp;lt;scope&amp;gt;compile&amp;lt;/scope&amp;gt;
&amp;lt;exclusions&amp;gt;
&amp;lt;exclusion&amp;gt;
&amp;lt;artifactId&amp;gt;jmxri&amp;lt;/artifactId&amp;gt;
    &amp;lt;groupId&amp;gt;com.sun.jmx&amp;lt;/groupId&amp;gt; 
&amp;lt;/exclusion&amp;gt;
&amp;lt;exclusion&amp;gt;&amp;lt;artifactId&amp;gt;jms&amp;lt;/artifactId&amp;gt;
    &amp;lt;groupId&amp;gt;javax.jms&amp;lt;/groupId&amp;gt;&amp;lt;/exclusion&amp;gt;
&amp;lt;exclusion&amp;gt;&amp;lt;artifactId&amp;gt;jmxtools&amp;lt;/artifactId&amp;gt;
      &amp;lt;groupId&amp;gt;com.sun.jdmk&amp;lt;/groupId&amp;gt;&amp;lt;/exclusion&amp;gt;
&amp;lt;/exclusions&amp;gt;
&amp;lt;/dependency&amp;gt;
&amp;lt;dependency&amp;gt;
      &amp;lt;groupId&amp;gt;org.slf4j&amp;lt;/groupId&amp;gt; 
      &amp;lt;artifactId&amp;gt;slf4j-log4j12&amp;lt;/artifactId&amp;gt; 
&amp;lt;version&amp;gt;1.7.5&amp;lt;/version&amp;gt;
&amp;lt;/dependency&amp;gt;
&amp;lt;dependency&amp;gt;
    &amp;lt;groupId&amp;gt;org.apache.kafka&amp;lt;/groupId&amp;gt;
&amp;lt;artifactId&amp;gt;kafka-clients&amp;lt;/artifactId&amp;gt;
&amp;lt;version&amp;gt;0.8.2.2&amp;lt;/version&amp;gt;
&amp;lt;/dependency&amp;gt;
&amp;lt;dependency&amp;gt;
&amp;lt;groupId&amp;gt;org.apache.spark&amp;lt;/groupId&amp;gt;
&amp;lt;artifactId&amp;gt;spark-core_2.10&amp;lt;/artifactId&amp;gt;
&amp;lt;version&amp;gt;1.4.1&amp;lt;/version&amp;gt;
&amp;lt;/dependency&amp;gt;
&amp;lt;dependency&amp;gt;
    &amp;lt;groupId&amp;gt;org.apache.spark&amp;lt;/groupId&amp;gt;
&amp;lt;artifactId&amp;gt;spark-sql_2.10&amp;lt;/artifactId&amp;gt;
&amp;lt;version&amp;gt;1.4.1&amp;lt;/version&amp;gt;
&amp;lt;/dependency&amp;gt;
&amp;lt;dependency&amp;gt;
&amp;lt;groupId&amp;gt;org.scala-lang&amp;lt;/groupId&amp;gt;
&amp;lt;artifactId&amp;gt;scala-library&amp;lt;/artifactId&amp;gt;
&amp;lt;version&amp;gt;2.10.4&amp;lt;/version&amp;gt;
&amp;lt;/dependency&amp;gt;
&amp;lt;/dependencies&amp;gt;&lt;/PRE&gt;&lt;PRE&gt;package net.mynet;
import kafka.consumer.Consumer;
import kafka.consumer.ConsumerConfig;
import kafka.consumer.ConsumerIterator;
import kafka.consumer.KafkaStream;
import kafka.javaapi.consumer.ConsumerConnector;
import org.apache.spark.sql.DataFrame;
import org.apache.spark.sql.SQLContext;
import java.util.ArrayList;
import java.util.HashMap;
import java.util.List;
import java.util.Map;
import java.util.Properties;
import java.util.concurrent.ExecutorService;
import java.util.concurrent.Executors;
import java.util.concurrent.TimeUnit;
import org.apache.spark.api.java.JavaSparkContext;
import org.apache.spark.api.java.JavaRDD;
import org.apache.spark.SparkConf;
import org.apache.kafka.clients.consumer.ConsumerRecord;
import org.apache.kafka.clients.consumer.ConsumerRecords;
import java.util.concurrent.*;
public class ReadFileConsumer {
	public static void main(String[] args) {
	Properties config = new Properties();
    config.put("zookeeper.connect", "192.168.60.128:2181");
    config.put("group.id", "default");
    config.put("partition.assignment.strategy", "roundrobin");
    config.put("bootstrap.servers", "192.168.60.128:9092");
    config.put("zookeeper.session.timeout.ms", "400");
    config.put("zookeeper.sync.time.ms", "200");
   config.put("auto.commit.interval.ms", "1000");
    config.put("value.deserializer", "org.apache.kafka.common.serialization.StringDeserializer");
  config.put("key.deserializer", "org.apache.kafka.common.serialization.StringDeserializer");
    kafka.consumer.ConsumerConfig consumerConfig = new kafka.consumer.ConsumerConfig(config);
    ConsumerConnector consumerConnector = Consumer.createJavaConsumerConnector(consumerConfig);
    Map&amp;lt;String, Integer&amp;gt; topicCountMap = new HashMap&amp;lt;String, Integer&amp;gt;();
    topicCountMap.put("test", 1);
    Map&amp;lt;String, List&amp;lt;KafkaStream&amp;lt;byte[], byte[]&amp;gt;&amp;gt;&amp;gt; consumerMap = consumerConnector.createMessageStreams(topicCountMap);
    List&amp;lt;KafkaStream&amp;lt;byte[], byte[]&amp;gt;&amp;gt; streamList = consumerMap.get("test");
    KafkaStream&amp;lt;byte[], byte[]&amp;gt; stream = streamList.get(0);
    ConsumerIterator&amp;lt;byte[], byte[]&amp;gt; iterator = stream.iterator();
    while(iterator.hasNext()) {
     System.out.println(new String(iterator.next().message()));
    }
  }
  public static void processRecords(Map&amp;lt;String, ConsumerRecords&amp;lt;String, String&amp;gt;&amp;gt; records) {
	  ArrayList arr = new ArrayList();
    List&amp;lt;ConsumerRecord&amp;lt;String, String&amp;gt;&amp;gt; messages = records.get("test").records();
    if(messages != null) {
      for (ConsumerRecord&amp;lt;String, String&amp;gt; next : messages) {
        try {
   	arr.add(next.value());
          System.out.println(next.value());
        } catch (Exception e) {
          e.printStackTrace();}
      }
    SparkConf conf = new SparkConf().setAppName("Consumer").setMaster("local");
	  JavaSparkContext sc = new JavaSparkContext(conf);
      JavaRDD&amp;lt;String&amp;gt; distData = sc.parallelize(arr);
      SQLContext sqlContext = new SQLContext(sc);
      DataFrame df = sqlContext.createDataFrame(distData,null);
    } else {     System.out.println("No messages"); }
  }
}
&lt;/PRE&gt;&lt;P&gt;,&lt;/P&gt;</description>
      <pubDate>Wed, 17 Feb 2016 00:54:03 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143720#M19713</guid>
      <dc:creator>hoda_moradi2014</dc:creator>
      <dc:date>2016-02-17T00:54:03Z</dc:date>
    </item>
    <item>
      <title>Re: I just start learning kafka and spark. I am trying to run a simple program for kafka consumer which saves data in spark DataFrame. But, I get the following error.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143721#M19714</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/2833/hodamoradi2014.html" nodeid="2833"&gt;@hoda moradi&lt;/A&gt;&lt;P&gt; please add HDP repositories to your pom, make sure to add dependencies for our versions of Spark and Kafka, 0.8.1 is old.&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.hortonworks.com/questions/626/how-do-i-resolve-maven-dependencies-for-building-a.html" target="_blank"&gt;https://community.hortonworks.com/questions/626/how-do-i-resolve-maven-dependencies-for-building-a.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 17 Feb 2016 00:56:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143721#M19714</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-02-17T00:56:17Z</dc:date>
    </item>
    <item>
      <title>Re: I just start learning kafka and spark. I am trying to run a simple program for kafka consumer which saves data in spark DataFrame. But, I get the following error.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143722#M19715</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt; spark version is 1.4.1 and my kafka is 0.8.2.2 and I add the right versions. I still have the same error.&lt;/P&gt;</description>
      <pubDate>Wed, 17 Feb 2016 01:09:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143722#M19715</guid>
      <dc:creator>hoda_moradi2014</dc:creator>
      <dc:date>2016-02-17T01:09:48Z</dc:date>
    </item>
    <item>
      <title>Re: I just start learning kafka and spark. I am trying to run a simple program for kafka consumer which saves data in spark DataFrame. But, I get the following error.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143723#M19716</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/2833/hodamoradi2014.html" nodeid="2833"&gt;@hoda moradi&lt;/A&gt;&lt;P&gt;&lt;A href="http://spark.apache.org/docs/latest/streaming-kafka-integration.html"&gt;http://spark.apache.org/docs/latest/streaming-kafka-integration.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 18 Feb 2016 00:34:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143723#M19716</guid>
      <dc:creator>nsabharwal</dc:creator>
      <dc:date>2016-02-18T00:34:57Z</dc:date>
    </item>
    <item>
      <title>Re: I just start learning kafka and spark. I am trying to run a simple program for kafka consumer which saves data in spark DataFrame. But, I get the following error.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143724#M19717</link>
      <description>&lt;P&gt;Thank you for your response. I realized that I cannot even run a simple kafka consumer program. Even with out using spark. I still get this error java.lang.NoClassDefFoundError: scala/reflect/ClassManifest. Do you have any idea why? I checked all of my version are correct.&lt;/P&gt;</description>
      <pubDate>Thu, 18 Feb 2016 02:33:11 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143724#M19717</guid>
      <dc:creator>hoda_moradi2014</dc:creator>
      <dc:date>2016-02-18T02:33:11Z</dc:date>
    </item>
    <item>
      <title>Re: I just start learning kafka and spark. I am trying to run a simple program for kafka consumer which saves data in spark DataFrame. But, I get the following error.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143725#M19718</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/2833/hodamoradi2014.html" nodeid="2833"&gt;@hoda moradi&lt;/A&gt; What HDP version are you using? I use this &lt;A href="http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.4/bk_installing_manually_book/content/ch_installing_kafka_chapter.html" target="_blank"&gt;http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.4/bk_installing_manually_book/content/ch_installing_kafka_chapter.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 18 Feb 2016 02:41:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143725#M19718</guid>
      <dc:creator>nsabharwal</dc:creator>
      <dc:date>2016-02-18T02:41:56Z</dc:date>
    </item>
    <item>
      <title>Re: I just start learning kafka and spark. I am trying to run a simple program for kafka consumer which saves data in spark DataFrame. But, I get the following error.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143726#M19719</link>
      <description>&lt;P&gt;HDP version is the latest one 2.3.2. I use that too but, when I try to run a simple consumer program in java, it does not work. I use kaka.consumer package in my code. My producer code works fine now I want to get those data and do some analysis on them in my consumer code.&lt;/P&gt;</description>
      <pubDate>Thu, 18 Feb 2016 02:54:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143726#M19719</guid>
      <dc:creator>hoda_moradi2014</dc:creator>
      <dc:date>2016-02-18T02:54:36Z</dc:date>
    </item>
    <item>
      <title>Re: I just start learning kafka and spark. I am trying to run a simple program for kafka consumer which saves data in spark DataFrame. But, I get the following error.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143727#M19720</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/2833/hodamoradi2014.html" nodeid="2833"&gt;@hoda moradi&lt;/A&gt;  I have used this in the past &lt;A href="http://hortonworks.com/hadoop-tutorial/ingesting-processing-real-time-events-apache-storm/"&gt;http://hortonworks.com/hadoop-tutorial/ingesting-processing-real-time-events-apache-storm/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;It has kafka code too. Creative ideas ... &lt;/P&gt;</description>
      <pubDate>Thu, 18 Feb 2016 03:04:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143727#M19720</guid>
      <dc:creator>nsabharwal</dc:creator>
      <dc:date>2016-02-18T03:04:47Z</dc:date>
    </item>
    <item>
      <title>Re: I just start learning kafka and spark. I am trying to run a simple program for kafka consumer which saves data in spark DataFrame. But, I get the following error.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143728#M19721</link>
      <description>&lt;P&gt;not sure which IDE you're using but with Netbeans, you can add the base repository and then you can use find dependencies which will do a search on Maven Central. You can find all deps that way. &lt;A rel="user" href="https://community.cloudera.com/users/2833/hodamoradi2014.html" nodeid="2833"&gt;@hoda moradi&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 18 Feb 2016 04:46:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143728#M19721</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-02-18T04:46:15Z</dc:date>
    </item>
    <item>
      <title>Re: I just start learning kafka and spark. I am trying to run a simple program for kafka consumer which saves data in spark DataFrame. But, I get the following error.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143729#M19722</link>
      <description>&lt;P&gt;I solved that error by adding this dependency to my project.&lt;/P&gt;&lt;P&gt;&amp;lt;dependency&amp;gt;&lt;/P&gt;&lt;P&gt;&amp;lt;groupId&amp;gt;log4j&amp;lt;/groupId&amp;gt;&lt;/P&gt;&lt;P&gt;&amp;lt;artifactId&amp;gt;log4j&amp;lt;/artifactId&amp;gt;&lt;/P&gt;&lt;P&gt;&amp;lt;version&amp;gt;1.2.17&amp;lt;/version&amp;gt;&lt;/P&gt;&lt;P&gt;&amp;lt;/dependency&amp;gt;&lt;/P&gt;</description>
      <pubDate>Sun, 21 Feb 2016 07:34:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-just-start-learning-kafka-and-spark-I-am-trying-to-run-a/m-p/143729#M19722</guid>
      <dc:creator>hoda_moradi2014</dc:creator>
      <dc:date>2016-02-21T07:34:25Z</dc:date>
    </item>
  </channel>
</rss>

