Member since
01-31-2017
2
Posts
1
Kudos Received
0
Solutions
01-31-2017
05:24 PM
1 Kudo
Is there any performance difference in below approches 1 )first read all queries from hive table and running all queries paraller ( with HiveContext) using java threading 2) using oozie multiple spark action and run each query individual Doubts: How does spark allocats resources in first case while running on Yarn Some thoughts In 2) we can optimize jobs by setting config but how to do optimization for all queries in 1). Any thougth about internal processing on yarn Thanks in advance
... View more
- Tags:
- HIVECONTEXT
- Spark
Labels:
- Labels:
-
Spark