Member since
06-14-2017
4
Posts
0
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
24710 | 06-20-2017 07:07 AM |
06-20-2017
02:23 PM
We (I should say, IT-team) find a solution Upgrade of python packages thrift (to version 0.10.0) and
PyHive (to version 0.3.0) don’t know why the version we used wasn’t the latest. Added the following: <property>
<name>hive.server2.authentication</name>
<value>NOSASL</value></property> in the hive config from pyhive import hive
conn = hive.Connection(host="myserver",
auth='NOSASL')import pandas as pd
import sys
df = pd.read_sql("SELECT * FROM my_table",
conn)print(sys.getsizeof(df))
df.head() worked without problem then. Best, Thomas
... View more
06-20-2017
07:07 AM
Hi, thanks for the answer. We (I should say, IT-team) find a solution Upgrade of python packages thrift (to version 0.10.0) and PyHive (to version 0.3.0) don’t know why the version we used wasn’t the latest. Added the following: <property>
<name>hive.server2.authentication</name>
<value>NOSASL</value>
</property> To the following Hive config parameters in Cloudera Manager: HiveServer2 Advanced Configuration Snippet (Safety Valve) for hive-site.xml Hive Client Advanced Configuration Snippet (Safety Valve) for hive-site.xml necessary so that HUE would work from pyhive import hive
conn = hive.Connection(host="myserver", auth='NOSASL')
import pandas as pd
import sys
df = pd.read_sql("SELECT * FROM m_ytable", conn)
print(sys.getsizeof(df))
df.head() worked without problem/error. Best, Tom
... View more
06-14-2017
11:48 PM
Hi, I'm a Hadoop newbie so don't shoot me yet. I'm trying to import hive tables with python as described how to access hive via python Namely I'm working on a BDA cloudera under red hat 4.4.7 with GCC 4.4.7 with anaconda-python 3.5.2 installed on a single node and python 2.6.6 system wide. The following packages are installed using anaconda (python 3.5.2 therefore): - cyrus-sasl-devel - python-devel - pyhive when I used the sample code (complete code and error message : stackoverflow post) from pyhive import hive conn = hive.Connection(host="myserver", port = 10000) it throws: "Could not start sasl" I digged in forums, googlized a lot but I didn' find a fix for this issue (tried to uninstall reinstall different version of the sasl package, tried pyhs2 but still rely on sasl) Have you any idea ? your help will be greatly appreciated ! Thanks, Tom
... View more
Labels:
- Labels:
-
Apache Hive
06-13-2017
03:06 PM
Hi, I'm a Hadoop newbie, so don't shoot me yet. I tried to set a hive connection as described here query-hive-using-python.html I want to set a hive connection using the hive.Connection with python 3.5.2 but the SASL package seems to cause a problem. I saw on a forum that SASL is compatible only with 2.7 python. Is that right ? Thank you in advance !
Tom from pyhive import hive#conn = hive.Connection(host="myserver", port=10000)conn = hive.Connection(host="myserver")import pandas as pd
TTransportException Traceback (most recent call last)in ()1 from pyhive import hive2 #conn = hive.Connection(host="myserver", port=10000)----> 3 conn = hive.Connection(host="myserver") import pandas as pd/opt/anaconda3/lib/python3.5/site-packages/pyhive/hive.py in init(self, host, port, username, database, auth, configuration)102103 try:--> 104 self._transport.open()105 open_session_req = ttypes.TOpenSessionReq(106 client_protocol=protocol_version,/opt/anaconda3/lib/python3.5/site-packages/thrift_sasl/init.py in open(self)70 if not ret:71 raise TTransportException(type=TTransportException.NOT_OPEN,---> 72 message=("Could not start SASL: %s" % self.sasl.getError()))7374 # Send initial responseTTransportException: TTransportException(message="Could not start SASL: b'Error in sasl_client_start (-4) SASL(-4): no mechanism available: No worthy mechs found'", type=1)
... View more
Labels:
- Labels:
-
Apache Hive