About reddyr211

reddyr211 · ‎12-26-2018

@Saurav Ranjit please collect the yarn logs and attach them here to debug further, use below command to collect logs yarn logs -applicationId applicatioon_1545806970486_0001

reddyr211 · ‎09-06-2018

@Jay Kumar SenSharma is there way we can get "Average percentage of local files to Region server in the cluster" for entire HBase cluster instead of querying by each region server , thanks.

reddyr211 · ‎09-05-2018

@Jay Kumar SenSharma thanks for your response, i'm looking for "Average percentage of local files to Region server in the cluster". Is there any other way apart from Ambari and Grafana to pull directly from HBase because we don't use Ambari or Grafana, thanks.

reddyr211 · ‎09-05-2018

Hello Everyone, Is there any API/CURL command to pull HBase data locality ? we can monitor or view data locality from Ambari without that is there any easy way to find data locality in HBase. HBase version: 1.1.2.2.5.3.0-37 HDP version:2.5.3 Thanks.

reddyr211 · ‎06-26-2018

@Vinicius Higa Murakami Thanks for your reply. if we enable vectorization it corrupts timestamp values every time or it's intermittent ? because with vectorization enabled it was working fine, we are started to see this happening recently.

reddyr211 · ‎06-25-2018

hive-logs.txtHello Everyone, Hive Insert query to table(have partitions) failing with vectorization enabled . It was working fine till last week.However it is running fine with vectorization disabled. Can someone help in identifying the issue. please find the logs with vectorization enabled and disabled/turned off.

reddyr211 · ‎06-20-2018

screen-shot-2018-06-20-at-125515-pm.png@anarasimham thanks for your reply. mdse_item_i data type is int `mdse_item_i` int and the output attached as screenshot

reddyr211 · ‎06-20-2018

screen-shot-2018-06-19-at-104245-pm.png Count distinct doesn't always give me the right answer. I've attached two different queries that should both result in 7 unique items purchased. If I don't do an operation on mdse_item_i like cast it to a bigint, it doesn't always count them correctly. to explain it simply, when i do cast on "mdse_itcountsdistinct-1.txtdoesnotcountdistinct.txtem_i" it gives unique results as 7, but when i don't do casting it gives unique results as 10 which is not correct. hive> select * from dfr_distinct; OK 100000000938 5 7 12.33 2 2.75 4.27 8.060 2 8 0 Time taken: 0.479 seconds, Fetched: 1 row(s) hive> select * from dfr_distinctnot; OK 100000000938 5 10 12.33 2 2.75 4.27 8.06 0 2 8 0 Time taken: 0.932 seconds, Fetched: 1 row(s) tried running the query in both MR and Tez modes still giving same results when i don't do casting.

reddyr211 · ‎03-08-2018

@Geoffrey Shelton Okot Thank you.

reddyr211 · ‎03-08-2018

@Aditya Sirna it works like a charm, thank you so much 🙂

Online	Offline
Last Visited	‎03-30-2020 05:28 PM

Member Since	‎01-11-2017 08:13 PM
Last Visited	‎03-30-2020 05:28 PM
Posts	65
Kudos received	1

Cloudera Community

Re: Vertex failed : Hive query error with tez engi...

Re: API to pull HBase data locality

Re: API to pull HBase data locality

API to pull HBase data locality

Re: Hive Insert query to table(have partitions) fa...

Hive Insert query to table(have partitions) failin...

Re: Count Distinct discrepancy --Hive

Count Distinct discrepancy --Hive

Re: Hive-clients with Curl command

Re: Hive-clients with Curl command