Member since
01-19-2018
4
Posts
0
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1147 | 07-30-2019 07:51 AM |
01-15-2023
11:36 PM
Hello @prakodi, For CDH 6.3, you can review this article https://docs.cloudera.com/documentation/enterprise/6/6.3/topics/cm_bdr_hive_replication.html Hope this helps, Tarun Was your question answered? Make sure to mark the answer as the accepted solution. If you find a reply useful, say thanks by clicking on the thumbs-up button.
... View more
08-06-2019
06:05 AM
Thanks for the reply ! Views are already created by joining many underlying table, hence joining the views again for data aggregation will result performance issue. Here are the two approach i came up with 1. Extract data from Hive view into files. 2. Create intermediate Hive tables and load data extracted from views. 3. Join the new hive tables to generate the final file. Another approach to use PySpark to read data from views directly , aggreate and transform the data and generate the final output file.
... View more
01-19-2018
11:47 PM
https://stackoverflow.com/questions/46857090/adding-pyspark-python-path-in-oozie
... View more