- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Impala and unicode (displaying issue)
- Labels:
-
Apache Impala
Created on 11-26-2017 07:49 AM - edited 09-16-2022 05:34 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi, all!
I'am using impyla(python labrary) to insert data to Impala and all the strings are in unicode format.
All the tables have STRING format of the fields.
But i can't see unicode symbols using select statement from client(using jdbc or odbc driver).
How can I handle it?
Thanks
Created 12-04-2017 02:52 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I've read everything related to data types in Impala
But finally, I found out that the problem was in SQL query constructing on Python side.
Created 11-27-2017 04:41 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Impala does not fully support Unicode. See "Character Sets" here:
https://www.cloudera.com/documentation/enterprise/latest/topics/impala_string.html
That said, reading/writing UTF-8 should work in most circumstances. Is your data in UTF-8 or a different representation of Unicode? Other encodings will likely not work at all.
Do queries using the Impala Shell return expected results?
Created 12-04-2017 02:52 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I've read everything related to data types in Impala
But finally, I found out that the problem was in SQL query constructing on Python side.
Created 12-04-2017 09:03 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Thanks for following up! Do you mind summarizing the changes you made to fix it so others can benefit from your solution?