About smruti

smruti · ‎02-05-2025

@Mamun_Shaheed I just wanted to mention that if you plan to use JAAS config, then set KrbAuthType=1, else if you want the subject for krb authentication obtained automatically set the value to '0'. currently it is set to 2.

smruti · ‎12-27-2024

@zorrofrombrasil How are you running this job? Is it through Zeppelin or any JDBC app? The reason I ask is because I see a keep-alive query "SELECT 1" being run, so the session does not go idle. Nevertheless, this "Invalid SessionHandle" could appear if the connection has switched over to a different hiveserver2 instance. The reason could be a network glitch, or any communication drop(timeout) at the middle layer such as Load Balancer/Knox(if in use). If we are dealing with such large data size, it is better to use a native thrift client like "beeline" that can be run in a Screen so it is not interrupted when a user closes a terminal, or shuts down the client computer. For JDBC application, make sure SocketTimeout is disabled(value = 0), and hive.server2.idle.operation.timeout & hive.server2.idle.session.timeout are set to large values like 6hrs and 8hrs respectively in Hive configuration.

smruti · ‎11-26-2024

I'm 3 years late, but in case anyone else facing the same issue, try using auth='NOSASL'. Maybe the issue does not occur with newer versions.

smruti · ‎09-24-2024

@NagendraKumar MSCK Repair which is a DDL operation can be run through PutHiveQL processor. For info on the classification of a Hive command, refer to https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL

smruti · ‎09-24-2024

@NagendraKumar TRUNCATE is basically a DDL operation which you can run using PutHiveQL processor. For info on the classification of a Hive command, refer to https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL

smruti · ‎09-10-2024

Do we get this error after it gets connected? Could there be a socket timeout error set at client side? I am not aware if the hive driver has a default timeout value. Instead of ZooKeeper, could you try connecting to HS2 directly. e.g. host: "hs2 node fqdn" port: 10001 Remove serviceDiscoveryMode: 'zooKeeper', zooKeeperNamespace: 'hiveserver2'

smruti · ‎09-09-2024

@Viki_Nodejs It seems like the hostname resolution fails. The url you have shared is rather jdbc compliant. Could you give the following a try? client.connect( { host: 'abc.com', port: 2181 }, new hive.connections.HttpConnection({ transportMode: 'http', httpPath: 'cliservice', serviceDiscoveryMode: 'zooKeeper', zooKeeperNamespace: 'hiveserver2' }), new hive.auth.PlainHttpAuthentication({ username: 'username', password: 'password' }), { ssl: true, sslTrustStore: '/etc/hive-jks/hivetrust.jks', trustStorePassword: 'pro@!23' } ).then(async client => {

smruti · ‎08-18-2024

Until now, you could query JDBC data sources from Hive using JDBC Storage Handler. However, with the release of CDP 7.2.18 (Public Cloud runtime) and 7.1.9 SP 1 (Private Cloud Base), you can also query Hive tables hosted in a different environment using a similar method. Here's a brief demo: 0: jdbc:hive2://ccycloud-1.sbiswal.root.como> CREATE EXTERNAL TABLE loc_hive_table(c1 int, c2 string) STORED BY 'org.apache.hive.storage.jdbc.JdbcStorageHandler' TBLPROPERTIES( . . . . . . . . . . . . . . . . . . . . . . .> "hive.sql.database.type"="HIVE", . . . . . . . . . . . . . . . . . . . . . . .> "hive.sql.jdbc.driver"="org.apache.hive.jdbc.HiveDriver", . . . . . . . . . . . . . . . . . . . . . . .> "hive.sql.jdbc.url"="jdbc:hive2://sbiswal-jdbctest-master0.sup-defa.xxxxxxx.a4.cloudera.site/db1;ssl=true;transportMode=http;httpPath=sbiswal-jdbctest/cdp-proxy-api/hive", . . . . . . . . . . . . . . . . . . . . . . .> "hive.sql.dbcp.username”= ”user1”, . . . . . . . . . . . . . . . . . . . . . . .> "hive.sql.dbcp.password"="********", . . . . . . . . . . . . . . . . . . . . . . .> "hive.sql.table"="src_tbl", . . . . . . . . . . . . . . . . . . . . . . .> "hive.sql.dbcp.maxActive"="1"); 0: jdbc:hive2://ccycloud-1.sbiswal.root.como> select * from loc_hive_table; INFO : Compiling command(queryId=hive_20240818172147_9c14d21e-1802-43a6-b5a4-0846ff5bad7f): select * from loc_hive_table INFO : Semantic Analysis Completed (retrial = false) INFO : Created Hive schema: Schema(fieldSchemas:[FieldSchema(name:loc_hive_table.c1, type:int, comment:null), FieldSchema(name:loc_hive_table.c2, type:string, comment:null)], properties:null) INFO : Completed compiling command(queryId=hive_20240818172147_9c14d21e-1802-43a6-b5a4-0846ff5bad7f); Time taken: 10.033 seconds INFO : Executing command(queryId=hive_20240818172147_9c14d21e-1802-43a6-b5a4-0846ff5bad7f): select * from loc_hive_table INFO : Completed executing command(queryId=hive_20240818172147_9c14d21e-1802-43a6-b5a4-0846ff5bad7f); Time taken: 0.02 seconds INFO : OK +---------------------+---------------------+ | loc_hive_table.c1 | loc_hive_table.c2 | +---------------------+---------------------+ | 1 | abc | | 2 | def | +---------------------+---------------------+ 2 rows selected (12.581 seconds) 0: jdbc:hive2://ccycloud-1.sbiswal.root.como> The main point to highlight here is hive.sql.database.type"="HIVE. In this example, a Hive table db1.src_tbl in a remote cluster. To read the table from a different Hive cluster, an external table loc_hive_table had to be created leveraging JDBCStorageHandler. As you can see, the data was read successfully. If the table to be queried is under a non-default database, it can be defined as follows: Mention the DB name in the JDBC connection string, as used above: "hive.sql.schema"="db1.test1": You could query the table from a secured cluster using LDAP authentication. Please be informed that you can only read and not write to the remote Hive table using this approach. If you want to use any aggregate function in the query and encounter any of the following errors in the YARN application, we may have to add some jars at the session level. java.lang.ClassNotFoundException: org.apache.hive.jdbc.HiveDriver java.lang.NoClassDefFoundError: org/apache/hive/service/cli/HiveSQLException Add equivalent versions of these jars from your local parcel directory, 0: jdbc:hive2://ccycloud-1.sbiswal.root.como> ADD JAR /opt/cloudera/parcels/CDH/lib/hive/lib/hive-jdbc-3.1.3000.7.1.9.1000-103.jar; 0: jdbc:hive2://ccycloud-1.sbiswal.root.como> ADD JAR /opt/cloudera/parcels/CDH/lib/hive/lib/hive-service-3.1.3000.7.1.9.1000-103.jar; Now you should be able to run your query successfully.

smruti · ‎08-05-2024

@evanle96 Hive would treat the timestamp data to be in UTC format, and when we read it, it converts it from UTC to local time. But I understand that the actual data is already in America/Sao_Paulo tz. It'd be interesting to know how do we transform the epoch data in string format to timestamp in ORC? from_utc_timestamp(to_utc_timestamp(dt_afast_trabalho, 'America/Sao_Paulo'), 'America/Sao_Paulo') AS sao_paulo_time_corrected Can we try reading with the above transformation and see if the house field shows up as expected?

smruti · ‎07-25-2024

@AhXian I hope we have copied the bucket files correctly, and also ran MSCK REPAIR. Could you also run ANALYZE.. COMPUTE STATISTICS on the external table once. Make sure the DESCRIBE command on the external table reflects the number of data files correctly. Please share describe and "hdfs dfs -ls -R <externa table location>" o/p

Online	Offline
Last Visited	‎12-28-2025 11:37 PM

Member Since	‎10-28-2020 05:19 AM
Last Visited	‎12-28-2025 11:37 PM
Posts	622
Kudos received	46

Cloudera Community

Re: I have a HIVE table which contains JSON as a c...

Re: ANALYZE command not write data into hive metas...

Re: HBase stores base64 data when data is inserted...

Re: Deleting hive service on CDP Private Base and ...

Re: Not Able to run import command. it fails with ...

Re: DBeaver connection issue with CDP Hive.

Re: keep alive - Error message from Server: Inva...

Re: How to connect to Hive with pyhive on Windows

Re: MSCK hive external table using NiFi

Re: Truncate hive external table using NiFi

Re: Couldn't establish Hive DB session in nodejs

Re: Couldn't establish Hive DB session in nodejs

Support for Hive DatabaseType in JDBC Storage Hand...

Re: Issue with String to Timestamp Conversion in H...

Re: ANALYZE command not write data into hive metas...