Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Whether Hive table supports UTF-8 code as field delimiters?

Highlighted

Whether Hive table supports UTF-8 code as field delimiters?

New Contributor

Note : As my client standards I have to use only c- cedilla as a delimiter .

 

I created a sqoop command with c-cedilla as a delimiters.This is successfully worked. Now my data contains fields which is separted by c-cedilla .

Next step i created the Hive table with  c-cedila as field delimiters (created c-cedilla by using octal representation "\307"of it.) , here the problem arises. whenever i am quering the table only NULL values is displayed.

Please assits on this.

 

Find the syntx below.

sqoop :

        --fields-terminated-by "\0xc7"

hive 

       ROW FORMAT DELIMITED FIELDS TERMINATED BY '\307'
        STORED AS TEXTFILE

 

 

In some communities they states "Hive is not able to use  ASCII characters , higher octal number than 177"  is it so? please confirm on this.

 

and also there is a ticket for other  character , refer the link  "https://issues.apache.org/jira/browse/HIVE-14334" .