Member since
11-17-2018
2
Posts
0
Kudos Received
0
Solutions
11-19-2018
02:54 AM
As mentioned in my question. I have already tried using PyHive.
... View more
11-17-2018
06:17 PM
Hello, I have looked all over the internet and couldn't arrive at a neat solution. Scenario: I have hiveserver2 running in my company's QA environment. I am able to ssh into the env and perform queries on tables in HDFS using Hive. I inspected hive-site.xml in QA and found that it uses kerberos authentication. I would now like to execute queries programmatically using Python from my local machine for experimentation. I have come across PyHive and Beeline. My biggest hurdle is that I cannot find the JDBC url or IP address of the hiveserver2. Where can I get this information? I used the IP address of my QA env (ifconfig) in my Python script and it couldn't connect to hiveserver2. Is there any other workaround for accessing the tables in HDFS from Python?
... View more
Labels:
- Labels:
-
Apache Hadoop