New Contributor
Posts: 1
Registered: ‎02-06-2018

Case and accent insensitive ?

Hi there,


Mi issue is very basic, I'm looking for the equivalent function in Hive of this SQL function => 

COLLATE SQL_Latin1_General_CP1_CI_AI

It's very important for me because I work on towns like Sao Paulo which have different writing versions. I have an old topic regarding this subject on stackoverflow, which unfortunately didn't get any interesting answer.... 


Could you provide me help please ?


Best regards,


Cloudera Employee
Posts: 121
Registered: ‎11-20-2015

Re: Case and accent insensitive ?

You may set the character set used by Hive, for a given table, with the Table SerDe property "serialization.encoding".  Take a look at the follow JIRA for an example on how to use it:


If you would like to use the MultiDelimitSerDe class, referenced in HIVE-12653, this serialization feature is available starting in CDH 5.10.



The valid Character Sets are discussed in the following link.  In particular, take a look at the "Standard charsets."