About butkiz

VidyaSargur · ‎02-28-2024

@ctrl_alt_delete, I have reached out to you with further details.

hubbarja · ‎10-12-2016

You may need to check to make sure your rdd is not empty, depending on your processing empty batches within spark streaming can cause some issues. !rdd.isEmpty

Harsh J · ‎10-05-2016

While it may appear possible to do this I'd strongly recommend against it because when you'd read back a written 150 MB MOB cell, it'd give you heap utilisation problems during the RPC encoding and transfer done by the RS. Its probably better to store the larger-than-10 MB files as HDFS files and store their paths in HBase.

butkiz · ‎09-29-2016

there is: [desktop] app_blacklist= [liboozie] oozie_url=http://<hostname>:11000/oozie which i added to get it working.

butkiz · ‎09-29-2016

Hi, it works applying above configuration. But now i have a NullPointerException in my spark code (rdd.foreach): ... kafkaStream.foreachRDD(new VoidFunction<JavaPairRDD<String, byte[]>>() { public void call(JavaPairRDD<String, byte[]> rdd) throws Exception { rdd.foreach(new VoidFunction<Tuple2<String, byte[]>>() { public void call(Tuple2<String, byte[]> avroRecord) throws Exception { In local mode it works but not in yarn-cluster. Do you have any ideas in order to get it running? Best Regards, Butkiz

butkiz · ‎06-29-2016

solved: I've created an additionaly static JavaSparkContext, convert the (String) object to JavaRDD (jsc.parallelize()) and insert into HBase using "saveAsNewAPIHadoopDataset(conf)".

butkiz · ‎06-29-2016

I've solved this by adding more cpu cores local[*] and or run the job on cluster (with enough cpu core)

butkiz · ‎04-20-2016

It is ok to See no spark worker and Master roll in CM?

tavi99 · ‎04-18-2016

Using Hive Editor from Hue Web UI in 5.5.1 version. select * from tab1 ==> brings results (small table, three records) select count(*) from tab1 ==> doesn't bring any results. If i press F5 in Hue/Hive after that - ther result does appear... The same query (select count(*)) works properly from Impala, also from Beeline CLI. Restarted the browser (Chrome) and the whole cluster - didn't help. Seems to be somehow related to combination of Hive, Hue and browser... Any assistance is welcome. Thanks in advance, Avi

butkiz · ‎04-15-2016

Dear Colleages, I'm not able to change my community account email address. Can you tell me how does it works, please? Thanks in advance, Butkiz

Online	Offline
Last Visited	‎03-31-2017 01:37 AM

Member Since	‎01-05-2015 04:51 AM
Last Visited	‎03-31-2017 01:37 AM
Posts	38
Kudos received	2

Cloudera Community

Re: Hue does not know about oozie

Re: Spark Streaming insert String into HBase

Re: Spark streaming is not writing auto HBase

Re: No hive query result is displayed in hue after...

Re: change account email

Re: SparkStreaming nullPointerException on rdd.for...

Re: HBase cell size (files)

Re: Hue does not know about oozie

Re: Spark Streaming - out of memory when submit us...

Re: Spark Streaming insert String into HBase

Re: Spark streaming is not writing auto HBase

Re: Spark on Yarn Vs Stand alone?

Re: No hive query result is displayed in hue after...

Change cloudera community account email