Member since
04-25-2016
19
Posts
4
Kudos Received
0
Solutions
10-07-2016
02:42 AM
With spark 1.6 rollup/cube/groupingsets are not possible by query syntax but possible dataframe api. this works. var agg_result = json_df.select("scene_id","action_id","classifier","country","os_name","app_ver","user_key","device_id").cube("scene_id","action_id","classifier","country","os_name","app_ver").agg(count("user_key"), countDistinct("user_key"), countDistinct("device_id"))
... View more
06-14-2016
09:20 AM
Good you fixed it. I would just read a good hadoop book and understand the MapCombinerShuffleReduce process in detail. After that the majority of markers should be pretty self evident. https://www.amazon.com/Hadoop-Definitive-Guide-Tom-White/dp/1449311520/184-6666119-1311365?ie=UTF8&*Version*=1&*entries*=0
... View more
07-09-2016
06:32 PM
Did you have to restart HiveServer2 to get the changes to take effect?
... View more