Subject Author Views Posted
This is a topic with new unread messages
What is the best way to deduplicate messages coming from Kafka in Spark Streaming consumer? Assumin...
9 ‎05-27-2017 01:30 PM
This is a topic with new unread messages
Does Cloudera recommend using Structured Streaming in production for any of its latest CDH distribu...
10 ‎05-27-2017 01:26 PM
This is a topic with new unread messages
Hi there, I am running Hue 3.11 (Cloudera 5.10) and trying to get Hue Notebook working with a Kerb...
28 ‎05-26-2017 06:09 AM
This is a topic with new unread messages
I just recently setup my cluster and when I attempt to run a spark job using python I am getting er...
52 ‎05-18-2017 06:49 PM
This is a topic with new unread messages
In HIVE I have a table that was created by using pyspark. I have created the table like below. df...
49 ‎05-17-2017 10:11 AM
This is a topic with new unread messages
Requirement: Trigger a spark job from UI by user action (say submit button click). Once the spark...
72 ‎05-14-2017 11:12 PM
This is a topic with new unread messages
when spark can use the metadata (KUDU table) which is create by impala? now we using spark to ac...
65 ‎05-14-2017 09:14 PM
This is a topic with new unread messages
I have recently created two node hadoop cluster with CDH5.11.0 with Cloudera manager. It installed ...
86 ‎05-12-2017 12:22 PM
This is a topic with new unread messages
  Hi guys,   I have a spark cluster (standalone mode) , when I submit a job (or open spark-shell...
76 ‎05-11-2017 03:06 AM
This is a topic with new unread messages
Is there a way to get Hive queries to run on Spark 2.x with CDH 5.10.x or higher?   This post mak...
99 ‎05-03-2017 09:36 AM
This is a topic with new unread messages
My spark Job is submmited by oozie in hue.The spark is running in YARN-CLUSTER mode .  I am trying ...
75 ‎05-03-2017 04:24 AM
This is a topic with new unread messages
17/05/02 17:09:01 WARN util.NativeCodeLoader: Unable to load native-hadoop library for your platfor...
101 ‎05-02-2017 05:01 AM
This is a topic with new unread messages
Can a spark job running under yarn write a file not to HDFS (that works fine) but to a shared file ...
99 ‎04-28-2017 10:09 PM
This is a topic with new unread messages
Do Spark streaming support copying to MS SQL or MYSQL? I am getting error: java.lang.UnsupportedOp...
98 ‎04-27-2017 04:00 AM
This is a topic with new unread messages
I am trying to read a file and add two extra columns. 1. Seq no and 2. filename. When I run spark j...
99 ‎04-26-2017 03:38 AM
This is a topic with new unread messages
I would like to create a Grafana Dashboard for Spark Streaming jobs. I have installed all the requir...
100 ‎04-25-2017 06:04 PM
This is a topic with new unread messages
In a previous post on Multiple Spark versions, already solved, it was defined how to use multiple S...
100 ‎04-20-2017 10:13 AM
This is a topic with new unread messages
Hi, I have pyspark kernel setup and was able to see and use the kernel in JupyterHub. However, ...
99 ‎04-13-2017 04:32 PM
This is a topic with new unread messages
Hi.   SparkSQL 2.1 throws the following warning :    WARN client.Shim_v1_1: Caught Hive MetaExc...
98 ‎04-13-2017 02:06 PM
This is a topic with new unread messages
Hi gurus,   i'm  new to big data, right now i'm facing a problem. The problem is how to stream cs...
100 ‎04-11-2017 01:23 AM