About Shifu

vciampa · ‎09-21-2021

Thanks @Shelton. Reading I found these limitations: - Replicating to and from HDP to Cloudera Manager 7.x is not supported by Replication Manager. * The only option I saw: - Use DistCp to replicate data. - Hive external tables. For information to replicate data, contact Cloudera Support =>? https://docs.cloudera.com/cdp/latest/data-migration/topics/cdpdc-compatibility-matrix-bdr.html

smruti · ‎09-19-2021

@Kiddo you either need to get the md5 of all the records or you could collect it for the concatenated form of all the column names in a table. e.g. select md5(concat(col1,col2)) from table1; or. select md5(concat(*)) from (SELECT collect_list(column_name) as col_name from information_schema.columns where table_schema='db1' and table_name='table1')e; Hope this answers your question.

Shifu · ‎09-17-2021

Hi @manojamr I am glad to know your original issue got resolved. As per your last comment, your Query took 9.5 hours to get complete. In this case, we may need to check whether there is a delay or hungriness, or resource crunch or it is normal. To figure out that we may need beeline console output, QueryId, Application log, all HS2 and HMS logs. It would be great if you create a case with Cloudera so we would be happy to assist you. If you are happy with the reply, mark it Accept as Solution

Shifu · ‎09-01-2021

@Eric_B Yes, your understanding is correct.

Shifu · ‎09-01-2021

Hi @saikat As I can understand you are running a merge query and it is failing with java.lang.OutOfMemoryError error. Step 1: Could you please run major compaction on all the tables involves in the merge query(If it is an ACID table or else ignore step1). Once the major compaction is triggered make sure it got completed by running "show compactions;" command in the beeline. This will bring down some stats collection burden for the hive. How to run minor and major compaction? Alter table <table name> compact 'MAJOR'; Step 2: Once step1 is done. Please set the following propery in beeline session level and re-run the merge query set hive.tez.container.size=16384; set hive.tez.java.opts=-Xmx13107m; set tez.runtime.io.sort.mb=4096; set tez.task.resource.memory.mb=16384; set tez.am.resource.memory.mb=16384; set tez.am.launch.cmd-opts=-Xmx13107m; set hive.auto.convert.join=false; The TEZ container and AM size is set as 16GB, if the query got failed you can increase the value to 20GB(then hive.tez.java.opts and tez.am.launch.cmd-opts need to be configured 80% of container and AM size that is 16384). If the query got succeeded with 16GB of TEZ container and AM size then you can try to decrease it too 14/12/10 and figure out a benchmark where it is failing and getting succeeded. In this way, you can save resources. If you are happy with the comment, Mark it "Accept as Solution".

Shifu · ‎08-22-2021

Hi @Nil_kharat Still not resolved the issue. You may need to check the HS2 logs and application logs to figure the slowness. And one more thing how can we track the job that are running by user's. 1. Go to RM UI > Running/finished/killed > check the User column 2. CM > YARN > Applications > Based upon the user you can search over here. If you are happy with the response mark it as Accepts as Solution

Shifu · ‎08-19-2021

Hi @Nil_kharat Generally, in Hive you may see Query slowness, Query failure, Configuration issue, Alerts, Services down, Vulnerability issue, some bugs this kind of issues you may see.

Shifu · ‎08-14-2021

Hi @Nil_kharat Yes, you can also use the particular lsof command as well to figure out the number of established connections to the particular port.

Shifu · ‎08-13-2021

Hi @amitshanker Thanks for the update. I can see than you had set hive.server2.webui.use.spnego as true, that means kerberos is enabled in your cluster. If spnego and kerberos are enabled then few settings needs to be changed in browser. Could you please follow the below link. https://docs.cloudera.com/documentation/enterprise/6/6.3/topics/cdh_sg_browser_access_kerberos_protected_url.html

ryu · ‎08-12-2021

Thanks it worked.

Online	Offline
Last Visited	‎05-11-2022 05:47 AM

Member Since	‎03-29-2020 10:09 PM
Last Visited	‎05-11-2022 05:47 AM
Posts	110
Kudos received	9

Cloudera Community

Re: Hive table in power bi

Re: How do we identify hive metastore performance ...

Re: Halting due to Out Of Memory Error...Exit code...

Re: Error while executing hive merge query

Re: Upgrading Individual Components Post HDP 3.1.5

Re: Migrate DB from Hive 1.2 to Hive 3.1

Re: MD5 value of a table

Re: Halting due to Out Of Memory Error...Exit code...

Re: Upgrading Individual Components Post HDP 3.1.5

Re: Error while executing hive merge query

Re: YARN job is Running Slow

Re: real time issues in hive and spark

Re: How many users connected to HiveServer2

Re: Hive webUrl issue

Re: Permission denied as I am unable to delete a d...