Reply
New Contributor
Posts: 1
Registered: ‎02-06-2018

Case and accent insensitive ?

Hi there,

 

Mi issue is very basic, I'm looking for the equivalent function in Hive of this SQL function => 

COLLATE SQL_Latin1_General_CP1_CI_AI

It's very important for me because I work on towns like Sao Paulo which have different writing versions. I have an old topic regarding this subject on stackoverflow, which unfortunately didn't get any interesting answer.... 

 

Could you provide me help please ?

 

Best regards,

Nicolas.

Highlighted
Cloudera Employee
Posts: 31
Registered: ‎11-20-2015

Re: Case and accent insensitive ?

You may set the character set used by Hive, for a given table, with the Table SerDe property "serialization.encoding".  Take a look at the follow JIRA for an example on how to use it:

 

https://issues.apache.org/jira/browse/HIVE-12653

 

If you would like to use the MultiDelimitSerDe class, referenced in HIVE-12653, this serialization feature is available starting in CDH 5.10.

 

https://www.cloudera.com/documentation/enterprise/release-notes/topics/cdh_rn_fixed_in_510.html

 

 

The valid Character Sets are discussed in the following link.  In particular, take a look at the "Standard charsets."

 

https://docs.oracle.com/javase/7/docs/api/java/nio/charset/Charset.html

 

Announcements