Member since
05-22-2018
29
Posts
0
Kudos Received
0
Solutions
08-27-2019
02:18 PM
Hello @prathamesh_h There is great documentation on rowkey design here: https://hbase.apache.org/book.html#rowkey.design At a high level, you want to ensure that your rowkeys are as evenly distributed as possible. If you have very few sites and many articles for each site, you may not see great performance. You can consider ways to break your articles into smaller buckets within each site, and including this in your rowkey.
... View more
06-04-2019
03:56 AM
The above querstion and the entire reply thread below was originally posted in the Community Help Track. On Tue Jun 4 03:55 UTC 2019, a member of the HCC moderation staff moved it to the Data Processing track. The Community Help Track is intended for questions about using the HCC site itself.
... View more
12-04-2018
04:46 PM
Please include the version of HDP in every question you ask.
... View more
07-19-2018
04:53 AM
Not according to that Jira. It is well documented.
... View more
06-28-2018
07:29 PM
Hi @Prathamesh H! Sorry about my delay, so regarding your issue. Hmm for partitioned table, afaik, you'll have to summarize per partition, unfortunately 😞 I just heard around, that it's possible to get the size on Hive 2.0. I'm not sure, i didn't test it. One last thing, the command that i had sent to you, in this case (for partitioned table) would be: analyze table <table> partition(col1,col2) compute statistics; Hope this helps!
... View more