Member since
01-05-2015
38
Posts
2
Kudos Received
4
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
5579 | 09-29-2016 03:25 AM | |
2639 | 06-29-2016 04:34 AM | |
2210 | 06-29-2016 04:28 AM | |
11586 | 04-15-2016 12:32 AM |
02-28-2024
09:51 PM
@ctrl_alt_delete, I have reached out to you with further details.
... View more
10-12-2016
06:21 PM
You may need to check to make sure your rdd is not empty, depending on your processing empty batches within spark streaming can cause some issues. !rdd.isEmpty
... View more
10-05-2016
12:57 AM
While it may appear possible to do this I'd strongly recommend against it because when you'd read back a written 150 MB MOB cell, it'd give you heap utilisation problems during the RPC encoding and transfer done by the RS. Its probably better to store the larger-than-10 MB files as HDFS files and store their paths in HBase.
... View more
09-29-2016
06:12 AM
there is: [desktop] app_blacklist= [liboozie] oozie_url=http://<hostname>:11000/oozie which i added to get it working.
... View more
09-29-2016
06:07 AM
Hi, it works applying above configuration. But now i have a NullPointerException in my spark code (rdd.foreach): ... kafkaStream.foreachRDD(new VoidFunction<JavaPairRDD<String, byte[]>>() {
public void call(JavaPairRDD<String, byte[]> rdd) throws Exception {
rdd.foreach(new VoidFunction<Tuple2<String, byte[]>>() {
public void call(Tuple2<String, byte[]> avroRecord) throws Exception { In local mode it works but not in yarn-cluster. Do you have any ideas in order to get it running? Best Regards, Butkiz
... View more
06-29-2016
04:34 AM
solved: I've created an additionaly static JavaSparkContext, convert the (String) object to JavaRDD (jsc.parallelize()) and insert into HBase using "saveAsNewAPIHadoopDataset(conf)".
... View more
06-29-2016
04:28 AM
I've solved this by adding more cpu cores local[*] and or run the job on cluster (with enough cpu core)
... View more
04-18-2016
01:46 AM
Using Hive Editor from Hue Web UI in 5.5.1 version. select * from tab1 ==> brings results (small table, three records) select count(*) from tab1 ==> doesn't bring any results. If i press F5 in Hue/Hive after that - ther result does appear... The same query (select count(*)) works properly from Impala, also from Beeline CLI. Restarted the browser (Chrome) and the whole cluster - didn't help. Seems to be somehow related to combination of Hive, Hue and browser... Any assistance is welcome. Thanks in advance, Avi
... View more
04-15-2016
12:40 AM
1 Kudo
Dear Colleages, I'm not able to change my community account email address. Can you tell me how does it works, please? Thanks in advance, Butkiz
... View more