Created 02-16-2024 09:16 PM
Hello guys,
I currently work on a company that does provide hive 3.1 servers by using knox or zookeeper (kerberos) authentication methods. I am able to connect and query data by using a odbc connection on my personal computer.
Now I need to set the connection on a virtual jupiter notebook server with pyodbc , so, I am not able to install the ODBC (and probably the server is based on Linux anyway).
I´ve seen many options as pyhive, implya and others, but everything I find is regarding Hive 2 servers.
So, what could I use to set the python connection to the Hive 3 servers?
Created 02-18-2024 09:59 AM
@frbelotto I have not tried with pyhive, I think it requires additional modules if you want to connect using zookeeper quorum. But you could use jaydebeapi python module to connect to Hive3. It works for any type of connection string knox/ZK. You would require Hive driver that you could download from here.
An example on how to to make use of jaydebeapi module to connect to Hive:
import jaydebeapi
# Connection parameters
jdbc_url = 'jdbc:hive2://knox.host:8443/default' # JDBC URL for HiveServer2
username = 'your-username'
password = 'your-password'
jar_file = '/path/to/hive-jdbc-driver.jar' # Path to the Hive JDBC driver JAR file
# Establish connection to Hive
conn = jaydebeapi.connect(
'org.apache.hive.jdbc.HiveDriver',
jdbc_url,
[username, password],
jar_file
)
# Create cursor
cursor = conn.cursor()
# Execute Hive query
cursor.execute('SELECT * FROM hive_table')
# Fetch results
result = cursor.fetchall()
# Close cursor and connection
cursor.close()
conn.close()
Created 04-01-2024 02:36 AM
@frbelotto, @ZainK Did the response assist in resolving your query? If it did, kindly mark the relevant reply as the solution, as it will aid others in locating the answer more easily in the future.
Regards,
Vidya Sargur,