1973
Posts
1225
Kudos Received
124
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
790 | 04-03-2024 06:39 AM | |
1531 | 01-12-2024 08:19 AM | |
782 | 12-07-2023 01:49 PM | |
1344 | 08-02-2023 07:30 AM | |
1947 | 03-29-2023 01:22 PM |
06-06-2016
04:05 PM
NiFi will do that very easily, then you can trigger some Spark jobs to do final processing.
... View more
06-06-2016
02:04 PM
The https://stomp.github.io/implementations.html Stomp protocol is not supported,but someone could right a processor to read and write it fairly easily as STOMP has some solid Java clients.
... View more
06-04-2016
03:59 PM
3 Kudos
Has anyone integrated Apache Airflow and HDP? It looks interesting.
... View more
Labels:
- Labels:
-
Hortonworks Data Platform (HDP)
06-04-2016
05:07 AM
1 Kudo
Any updates for 2016
... View more
06-03-2016
09:09 PM
I have a script that loads data into an ORC table, seems I can only load TEXT data type. Are other data types supported? Or is there a better way to load bulk CSV data and load into ORC tables. Thanks.
... View more
Labels:
- Labels:
-
Apache Pig
06-02-2016
02:36 PM
KryoSerializer is pretty awesome. It is a faster Java serializer. This will speed up Spark, not related to your issue, but I like to add that to all my Spark projects. When RDDs are in memory they are serialized objects. So a faster, smaller serialization will help with speed and memory.
... View more
06-02-2016
02:35 PM
Great find Ryan. Let's see if we can get that added to the tutorial.
... View more
06-02-2016
02:08 PM
You really need more cores. But 2 may work. spark-submit --class "com.stuff.Class" \
--master yarn --deploy-mode client --driver-memory 1024m --executor-memory 1024m --conf spark.ui.port=4244 MyJar.jar remove this from your code .set("spark.master","yarn-client"); add this sparkConf.set("spark.cores.max", "1") sparkConf.set("spark.serializer", classOf[KryoSerializer].getName) sparkConf.set("spark.sql.tungsten.enabled", "true") sparkConf.set("spark.eventLog.enabled", "true") sparkConf.set("spark.app.id", "YourId")
... View more
06-02-2016
01:48 PM
can you post the spark-submit you listed?
... View more
06-02-2016
01:47 PM
what does it saw on the main page for YARN resources?
... View more