Member since
10-31-2017
2
Posts
0
Kudos Received
0
Solutions
10-31-2017
10:12 PM
Thanks for answering , However I have one more question here, In the existed HiveQL we are creating some temporary hive work tables to store the intermediate data and dropping those off at the end. In Spark QL instead of those temporary tables I can use createOrReplaceTempView("XXXX") to create a temporary in memory view . At any point while my data is growing what happens if this TempView can't fit in the memory. Will my job fails ? What I need to do to tackle these kind of scenarios? Appreciate your reply !!!
... View more
10-31-2017
04:45 PM
We have different tables in Hive and we are processing the data using HQL which includes some complex joins between multiple tables and on multiple conditions. Now we are planning to migrate to Spark 2.0.2. so can I use sparkSQL and use the same HQL query? or do I need to get the data to different DataFrames first and perform Joins and apply other operations on DataFrames instead of doing HQL? what is the better approach?
... View more
Labels:
- Labels:
-
Apache Spark