About michaelli

cjervis · ‎05-23-2023

@suhaa Just sent you a private message to discuss

asish · ‎06-22-2022

@Orcs Hive aux JARs path In CDP Private Cloud Base, click Cloudera Manager > Clusters and select the HIVE. Click Configuration and search for Hive Auxiliary JARs Directory. Specify a directory value for the Hive Aux JARs property if necessary, or make a note of the path. Upload the JAR to the specified directory on all Hive metastore instances. Click Cloudera Manager > Clusters and select the HIVE-ON-TEZ. Click Configuration and search for Hive Auxiliary JARs Directory. Upload the JAR to the specified directory on all HiveServer instances.

robbiez · ‎08-19-2021

This is a bug fixed by HIVE-15642. You can work around this issue by setting hive.metastore.dml.events to false.

michaelli · ‎06-09-2021

A final update on solution 3: 1. when you use beeline to connect to a hiveserver2 instance (let's name this hiveserver2 instance hiveserver2-instance1) and submit statement like "select udfTest(20210101) from testTableA", if the udf itself contains java codes to connect to the same hiveserver2 instance hiveserver2-instance1 and executes any statement, it will make the hiveserver2-instance1 not function properly; 2. when you use beeline to connect to a hiveserver2 instance( let's name this hiveserver2 instance hiveserver2-instance1) and submit statement like "select udfTest(20210101) from testTableA", if the udf itself contains java codes to connect to the another hiveserver2 instance like hiveserver2-instance2 and executes any statement, then both hiveserver2-instance1 and hiveserver2-instance2 will function properly; 3. when you use hive service --cli to submit statement like "select udfTest(20210101) from testTableA", and the udf calls other hiveserver2 instances like hiveserver2-instance1, then both hive cli and the hiveserver2 instance will funciton properly; 4. when you use beeline to connect to a hiveserver2 instance( let's name this hiveserver2 instance hiveserver2-instance1) and submit statement like "select udfTest(user_code) from testTableA", if the udf itself contains java codes to connect to the same hiveserver2 instance like hiveserver2-instance1 and executes any statement, then hiveserver2-instance1 will function properly. The root cause is whether you are using the same hiveserver2 instance as both hive sql client and hive sql server: this is not the case for scenario 2 and 3, where you use different hiveserver2 or hive service --cli; this is also not the case for scenario 4: in this case a mr/tez/spark job is generated and scheduled to run in a yarn container, which acts as the sql client and connects back to the hiveserver2 to submit sqls; but this is the case for scenario 1, as when hiveserver2 analyze and compile the sql statement "select udfTest(20210101) from testTableA" , it finds that no map task need be generated (as we are using constant 20210101 here, no table records need be fetched), so as part of the analyze and compile process, it connects to itself and tries to execute the sql call itself, which makes it both the sql client and sql server. So to sum up, use sql calls against hiveserver2 inside udf is not a goot practice.

michaelli · ‎12-11-2020

The titled SocketTimeoutException occurs when the thrift-client in hiveConnection object is in the process of actively reading sql results from hive server2 (thrift server), and is not able to receive anything until the TSocket's time out occurs. You can check the source code from:HiveConnection.setupLoginTimeout and HiveAuthFactory.getSocketTransport. So you need to either tuning hiveserver2, or increase the TSocket's timeout setting. And for now, the only way to increase Tsocket's time out setting is via: DriverManager.setLoginTimeout() you can check below jira for more information: https://issues.apache.org/jira/browse/HIVE-22196 https://issues.apache.org/jira/browse/HIVE-6679 https://issues.apache.org/jira/browse/HIVE-12371

paras · ‎05-09-2020

@michaelli You can use the CM API to fetch the service configurations by executing below command from terminal # curl -u admin_uname:admin_pass "http://cm_server_host:7180/api/v19/cm/deployment" The password should be present under the name : hive_metastore_database_password Refer: https://docs.cloudera.com/documentation/enterprise/5-14-x/topics/cm_intro_api.html#concept_dnn_cr5_mr__section_ztf_lyt_xr Hope this helps, Paras Was your question answered? Make sure to mark the answer as the accepted solution. If you find a reply useful, say thanks by clicking on the thumbs up button.

Online	Offline
Last Visited	‎06-05-2023 02:56 AM

Member Since	‎05-08-2020 09:35 PM
Last Visited	‎06-05-2023 02:56 AM
Posts	11

Cloudera Community

Re: possible ways to get configuration data in hiv...

Re: possible ways to get configuration data in hiv...

Re: not able to change my company name in my profi...

Re: java.lang.NoClassDefFoundError: org/antlr/runt...

Re: unable to insert overwrite partitioned hive ta...

Re: possible ways to get configuration data in hiv...

Re: Frequent "Read timed out" error from Hive, usi...

Re: how to check hive meta store db's password?