Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Impala and unicode (displaying issue)

avatar
Explorer

Hi, all!

I'am using impyla(python labrary) to insert data to Impala and all the strings are in unicode format.

All the tables have STRING format of the fields.

But i can't see unicode symbols using select statement from client(using jdbc or odbc driver).

How can I handle it?

Thanks

1 ACCEPTED SOLUTION

avatar
Explorer

I've read everything related to data types in Impala

But finally, I found out that the problem was in SQL query constructing on Python side.

View solution in original post

3 REPLIES 3

avatar

Impala does not fully support Unicode. See "Character Sets" here:

https://www.cloudera.com/documentation/enterprise/latest/topics/impala_string.html

 

That said, reading/writing UTF-8 should work in most circumstances. Is your data in UTF-8 or a different representation of Unicode? Other encodings will likely not work at all.

 

Do queries using the Impala Shell return expected results?

 

avatar
Explorer

I've read everything related to data types in Impala

But finally, I found out that the problem was in SQL query constructing on Python side.

avatar

Thanks for following up! Do you mind summarizing the changes you made to fix it so others can benefit from your solution?