<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: How to remove double quote from csv file at time of loading csv file into Hive orc tabel using data frame temp table.,How to remove double.quote from column variable present in csv file..,I am loading csv file into Hive orc table using data frame. in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-remove-double-quote-from-csv-file/m-p/104624#M46349</link>
    <description>&lt;P&gt;&lt;A href="https://community.hortonworks.com/users/14353/arpit3006.html"&gt;@Arpit Jain&lt;/A&gt;&lt;/P&gt;&lt;P&gt;When you create table as select ... into ORC table don't forget the cast the proper data type to match your target table. Some of the fields may get converted implicitly, others not.&lt;/P&gt;</description>
    <pubDate>Thu, 17 Nov 2016 06:30:07 GMT</pubDate>
    <dc:creator>cstanca</dc:creator>
    <dc:date>2016-11-17T06:30:07Z</dc:date>
    <item>
      <title>How to remove double quote from csv file ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-remove-double-quote-from-csv-file/m-p/104622#M46347</link>
      <description>&lt;P&gt;I m loading csv file into Hive orc table using data frame temporary table. After loading into Hive table data is present with double quote.&lt;/P&gt;
&lt;P&gt;Input file&lt;/P&gt;
&lt;P&gt;"Arpit","Jain",123&lt;/P&gt;
&lt;P&gt;"Qwee","ffhh",5778&lt;/P&gt;
&lt;P&gt;How to remove this double quote at time of inserting into Hive table which induce by csv format .&lt;/P&gt;
&lt;P&gt;,&lt;/P&gt;
&lt;P&gt;I m loading csv file to orc Hive table using data frame temporary table.&lt;/P&gt;
&lt;P&gt;But in Hive table it's loaded with double quote.&lt;/P&gt;
&lt;P&gt;How can I remove double quotes .&lt;/P&gt;
&lt;P&gt;Input csv file in hdfs&lt;/P&gt;
&lt;P&gt;"Arpit","Jain",1234,"India"&lt;/P&gt;
&lt;P&gt;"ABC","abcd",7657,"India"&lt;/P&gt;
&lt;P&gt;,&lt;/P&gt;</description>
      <pubDate>Tue, 21 Dec 2021 20:29:26 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-remove-double-quote-from-csv-file/m-p/104622#M46347</guid>
      <dc:creator>arpit3006</dc:creator>
      <dc:date>2021-12-21T20:29:26Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove double quote from csv file at time of loading csv file into Hive orc tabel using data frame temp table.,How to remove double.quote from column variable present in csv file..,I am loading csv file into Hive orc table using data frame.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-remove-double-quote-from-csv-file/m-p/104623#M46348</link>
      <description>&lt;P&gt;You will need to use OpenCSVSerde: &lt;A href="https://cwiki.apache.org/confluence/display/Hive/CSV+Serde" target="_blank"&gt;https://cwiki.apache.org/confluence/display/Hive/CSV+Serde&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Just add this to your create table ddl (and use the appropriate delim for separator character)&lt;/P&gt;&lt;PRE&gt;ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.OpenCSVSerde'
WITH SERDEPROPERTIES (
   "separatorChar" = ",",
   "quoteChar"     = "\""
) &lt;/PRE&gt;&lt;P&gt;A limitation is that it stores all fields as string. See link above and this one: &lt;A href="https://community.hortonworks.com/questions/56611/hive-ignoring-data-type-declarations-in-create-tab.html" target="_blank"&gt;https://community.hortonworks.com/questions/56611/hive-ignoring-data-type-declarations-in-create-tab.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;There are workarounds like loading using OpenCSVSerde into a temp table and then load that (Create table as select...) into an ORC table.&lt;/P&gt;&lt;P&gt;Alternatively, you could use pig to clean double quotes first and then load that data.&lt;/P&gt;&lt;P&gt;&lt;EM&gt;If this is what you were looking for, let me know by accepting the answer; else, let me know of any gaps.&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 16 Nov 2016 02:49:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-remove-double-quote-from-csv-file/m-p/104623#M46348</guid>
      <dc:creator>gkeys</dc:creator>
      <dc:date>2016-11-16T02:49:36Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove double quote from csv file at time of loading csv file into Hive orc tabel using data frame temp table.,How to remove double.quote from column variable present in csv file..,I am loading csv file into Hive orc table using data frame.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-remove-double-quote-from-csv-file/m-p/104624#M46349</link>
      <description>&lt;P&gt;&lt;A href="https://community.hortonworks.com/users/14353/arpit3006.html"&gt;@Arpit Jain&lt;/A&gt;&lt;/P&gt;&lt;P&gt;When you create table as select ... into ORC table don't forget the cast the proper data type to match your target table. Some of the fields may get converted implicitly, others not.&lt;/P&gt;</description>
      <pubDate>Thu, 17 Nov 2016 06:30:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-remove-double-quote-from-csv-file/m-p/104624#M46349</guid>
      <dc:creator>cstanca</dc:creator>
      <dc:date>2016-11-17T06:30:07Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove double quote from csv file at time of loading csv file into Hive orc tabel using data frame temp table.,How to remove double.quote from column variable present in csv file..,I am loading csv file into Hive orc table using data frame.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-remove-double-quote-from-csv-file/m-p/104625#M46350</link>
      <description>&lt;P&gt;doesn't work here, full script is as below:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;LI-CODE&gt;CREATE TABLE sr.sr2013 ( 
creation_date STRING,   
status STRING,   
first_3_chars_of_postal_code STRING,   
intersection_street_1 STRING,   
intersection_street_2 STRING,   
ward STRING,   
service_request_type STRING,   
division STRING,   
section STRING ) 
ROW FORMAT DELIMITED FIELDS TERMINATED BY ',' 
WITH SERDEPROPERTIES (
'colelction.delim'='\u0002', 
'mapkey.delim'='\u0003', 
'serialization.format'=',', 
'field.delim'=',', 
'skip.header.line.count'='1',
'quoteChar'= "\"") ;
&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;IMG src="https://community.cloudera.com/storage/attachments/93770-screenshot-12-2-2018-10-07-42-am.png" border="0" /&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 21 Dec 2021 19:16:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-remove-double-quote-from-csv-file/m-p/104625#M46350</guid>
      <dc:creator>axie</dc:creator>
      <dc:date>2021-12-21T19:16:22Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove double quote from csv file at time of loading csv file into Hive orc tabel using data frame temp table.,How to remove double.quote from column variable present in csv file..,I am loading csv file into Hive orc table using data frame.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-remove-double-quote-from-csv-file/m-p/104626#M46351</link>
      <description>&lt;P&gt;
	Impala rejected the change of:
&lt;/P&gt;
&lt;P&gt;
	ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.OpenCSVSerde'
&lt;/P&gt;</description>
      <pubDate>Sun, 02 Dec 2018 23:40:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-remove-double-quote-from-csv-file/m-p/104626#M46351</guid>
      <dc:creator>axie</dc:creator>
      <dc:date>2018-12-02T23:40:13Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove double quote from csv file at time of loading csv file into Hive orc tabel using data frame temp table.,How to remove double.quote from column variable present in csv file..,I am loading csv file into Hive orc table using data frame.</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-remove-double-quote-from-csv-file/m-p/332697#M46352</link>
      <description>Impala doesnt support the ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.OpenCSVSerde' even in newer version like v3.4.0. Any other option to remove double quotes in the output from Impala where the input csv file has quotes?&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 21 Dec 2021 16:26:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-to-remove-double-quote-from-csv-file/m-p/332697#M46352</guid>
      <dc:creator>ebeb</dc:creator>
      <dc:date>2021-12-21T16:26:17Z</dc:date>
    </item>
  </channel>
</rss>

