About jeesim2

jeesim2 · ‎10-07-2016

With spark 1.6 rollup/cube/groupingsets are not possible by query syntax but possible dataframe api. this works. var agg_result = json_df.select("scene_id","action_id","classifier","country","os_name","app_ver","user_key","device_id").cube("scene_id","action_id","classifier","country","os_name","app_ver").agg(count("user_key"), countDistinct("user_key"), countDistinct("device_id"))

bleonhardi · ‎06-14-2016

Good you fixed it. I would just read a good hadoop book and understand the MapCombinerShuffleReduce process in detail. After that the majority of markers should be pretty self evident. https://www.amazon.com/Hadoop-Definitive-Guide-Tom-White/dp/1449311520/184-6666119-1311365?ie=UTF8&*Version*=1&*entries*=0

Carol at Waterline · ‎07-09-2016

Did you have to restart HiveServer2 to get the changes to take effect?

Online	Offline
Last Visited	‎09-12-2017 08:51 AM

Member Since	‎04-25-2016 12:21 AM
Last Visited	‎09-12-2017 08:51 AM
Posts	19
Kudos received	4

Cloudera Community

Re: Combinational agg over multi-dimensional table

Re: Explanation of Tez task counters.

Re: Ambari hive's "hive.aux.jars.path" configurati...