<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Case-insensitive comparison Hive in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Case-insensitive-comparison-Hive/m-p/345292#M234473</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/98478"&gt;@DataMan-HJ&lt;/a&gt;&amp;nbsp;, the requirement you're looking for with case-insensitive joins doesn't seem to be present in Hive and likely will not be implemented as Hive relies on Java's UTF-8 strings and the behavior which implicitly comes with it - without possibility to change the collation.&lt;/P&gt;&lt;P&gt;There's a good discussion on&amp;nbsp;&lt;A href="https://issues.apache.org/jira/browse/HIVE-4070" target="_self"&gt;HIVE-4070&lt;/A&gt; where a similar ask is raised for the LIKE operator behavior. You can review the pros and cons there. So you will likely need to go ahead to change the individual joins to use the lower/upper functions.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Best regards&lt;/P&gt;&lt;P&gt;&amp;nbsp;Miklos&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 09 Jun 2022 10:35:29 GMT</pubDate>
    <dc:creator>mszurap</dc:creator>
    <dc:date>2022-06-09T10:35:29Z</dc:date>
    <item>
      <title>Case-insensitive comparison Hive</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Case-insensitive-comparison-Hive/m-p/345279#M234468</link>
      <description>&lt;P&gt;HI ,&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;I have a requirement where I need to do case-insensitive joins across the system and I don't wish to apply upper/lower functions.&amp;nbsp;&lt;BR /&gt;I tried setting&amp;nbsp;TBLPROPERTIES('serialization.encoding'='utf8mb4_unicode_ci') at Table level but still the comparison is happening considering case sensitivity.&amp;nbsp; PSB -&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;P&gt;drop table test.caseI;&lt;BR /&gt;create table test.caseI&lt;BR /&gt;(name string, id int)&lt;BR /&gt;TBLPROPERTIES('serialization.encoding'='utf8mb4_unicode_ci');&lt;BR /&gt;insert into test.caseI values ('hj',1);&lt;/P&gt;
&lt;P&gt;drop table test.caseI_2;&lt;BR /&gt;create table test.caseI_2&lt;BR /&gt;(name string, id int)&lt;BR /&gt;TBLPROPERTIES('serialization.encoding'='utf8mb4_unicode_ci');&lt;BR /&gt;insert into test.caseI_2 values ('HJ',1);&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;select * from test.caseI i&lt;BR /&gt;inner join test.caseI_2 i2 on i.name=i2.name;&lt;BR /&gt;--No Result&lt;BR /&gt;Tried with encoding 'SQL_Latin1_General_CP1_CI_AI' but got same result as above.&lt;BR /&gt;&lt;BR /&gt;Any help would be appreciated, thanks!&lt;/P&gt;</description>
      <pubDate>Tue, 21 Apr 2026 07:55:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Case-insensitive-comparison-Hive/m-p/345279#M234468</guid>
      <dc:creator>DataMan-HJ</dc:creator>
      <dc:date>2026-04-21T07:55:18Z</dc:date>
    </item>
    <item>
      <title>Re: Case-insensitive comparison Hive</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Case-insensitive-comparison-Hive/m-p/345292#M234473</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/98478"&gt;@DataMan-HJ&lt;/a&gt;&amp;nbsp;, the requirement you're looking for with case-insensitive joins doesn't seem to be present in Hive and likely will not be implemented as Hive relies on Java's UTF-8 strings and the behavior which implicitly comes with it - without possibility to change the collation.&lt;/P&gt;&lt;P&gt;There's a good discussion on&amp;nbsp;&lt;A href="https://issues.apache.org/jira/browse/HIVE-4070" target="_self"&gt;HIVE-4070&lt;/A&gt; where a similar ask is raised for the LIKE operator behavior. You can review the pros and cons there. So you will likely need to go ahead to change the individual joins to use the lower/upper functions.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Best regards&lt;/P&gt;&lt;P&gt;&amp;nbsp;Miklos&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 09 Jun 2022 10:35:29 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Case-insensitive-comparison-Hive/m-p/345292#M234473</guid>
      <dc:creator>mszurap</dc:creator>
      <dc:date>2022-06-09T10:35:29Z</dc:date>
    </item>
    <item>
      <title>Re: Case-insensitive comparison Hive</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Case-insensitive-comparison-Hive/m-p/345452#M234537</link>
      <description>&lt;P&gt;Thanks&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/12885"&gt;@mszurap&lt;/a&gt;&amp;nbsp;for your quick response. &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 13 Jun 2022 07:17:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Case-insensitive-comparison-Hive/m-p/345452#M234537</guid>
      <dc:creator>DataMan-HJ</dc:creator>
      <dc:date>2022-06-13T07:17:21Z</dc:date>
    </item>
  </channel>
</rss>

