<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Hive - Line Termination in Quotes in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Hive-Line-Termination-in-Quotes/m-p/368761#M240258</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/99744"&gt;@tj2007&lt;/a&gt;, It is not possible to modify the data. I have tried the "&lt;SPAN&gt;LazySimpleSerDe&lt;/SPAN&gt;" but it didn't give the correct output (Mentioned below).&lt;/P&gt;&lt;PRE&gt;&lt;SPAN&gt;ROW&lt;/SPAN&gt; &lt;SPAN&gt;FORMAT&lt;/SPAN&gt; &lt;SPAN&gt;SERDE&lt;/SPAN&gt; 
  &lt;SPAN&gt;'org.apache.hadoop.hive.serde2.OpenCSVSerde'&lt;/SPAN&gt;
&lt;SPAN&gt;with&lt;/SPAN&gt; &lt;SPAN&gt;serdeproperties&lt;/SPAN&gt;&lt;SPAN&gt; (
    &lt;/SPAN&gt;&lt;SPAN&gt;"separatorChar"&lt;/SPAN&gt; &lt;SPAN&gt;=&lt;/SPAN&gt; &lt;SPAN&gt;","&lt;/SPAN&gt;&lt;SPAN&gt;,
    &lt;/SPAN&gt;&lt;SPAN&gt;"quoteChar"&lt;/SPAN&gt;     &lt;SPAN&gt;=&lt;/SPAN&gt; &lt;SPAN&gt;'\""'&lt;/SPAN&gt;&lt;SPAN&gt;)      &lt;/SPAN&gt;&lt;SPAN&gt;STORED&lt;/SPAN&gt; &lt;SPAN&gt;AS&lt;/SPAN&gt; &lt;SPAN&gt;TEXTFILE&lt;/SPAN&gt;&lt;/PRE&gt;&lt;TABLE width="353"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD width="250"&gt;"IM43163","SOUTH,OFC","10-Jan-23"&lt;/TD&gt;&lt;TD width="78"&gt;?&lt;/TD&gt;&lt;TD width="25"&gt;?&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;"IM41763","John:&lt;/TD&gt;&lt;TD&gt;?&lt;/TD&gt;&lt;TD&gt;?&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;comment added","12-Jan-23"&lt;/TD&gt;&lt;TD&gt;?&lt;/TD&gt;&lt;TD&gt;?&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, I need output like this.&lt;/P&gt;&lt;TABLE width="458"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD width="250"&gt;IM43163&lt;/TD&gt;&lt;TD width="143"&gt;SOUTH,OFC&lt;/TD&gt;&lt;TD width="65"&gt;10-Jan-23&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;IM41763&lt;/TD&gt;&lt;TD&gt;John:comment added&lt;/TD&gt;&lt;TD&gt;12-Jan-23&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;Please also note that the input file is a CSV file which I am successfully able to open it in Excel. Your support will be highly appreciated.&lt;/P&gt;</description>
    <pubDate>Tue, 18 Apr 2023 06:07:44 GMT</pubDate>
    <dc:creator>Abdul_</dc:creator>
    <dc:date>2023-04-18T06:07:44Z</dc:date>
    <item>
      <title>Hive - Line Termination in Quotes</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-Line-Termination-in-Quotes/m-p/368671#M240232</link>
      <description>&lt;P&gt;My data set is as below.&lt;/P&gt;&lt;P&gt;"IM43163","SOUTH,OFC","10-Jan-23"&lt;/P&gt;&lt;P&gt;"IM41763","John:&lt;/P&gt;&lt;P&gt;comment added","12-Jan-23"&lt;/P&gt;&lt;PRE&gt;&lt;SPAN&gt;CREATE&lt;/SPAN&gt; &lt;SPAN&gt;EXTERNAL&lt;/SPAN&gt; &lt;SPAN&gt;TABLE&lt;/SPAN&gt; &lt;SPAN&gt;database&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;table1&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;
  &lt;SPAN&gt;`col_1`&lt;/SPAN&gt; &lt;SPAN&gt;string&lt;/SPAN&gt;&lt;SPAN&gt;, 
  &lt;/SPAN&gt;&lt;SPAN&gt;`col_2`&lt;/SPAN&gt; &lt;SPAN&gt;string&lt;/SPAN&gt;&lt;SPAN&gt;,
  &lt;/SPAN&gt;&lt;SPAN&gt;`col_3`&lt;/SPAN&gt; &lt;SPAN&gt;string&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;SPAN&gt;ROW&lt;/SPAN&gt; &lt;SPAN&gt;FORMAT&lt;/SPAN&gt; &lt;SPAN&gt;SERDE&lt;/SPAN&gt; 
  &lt;SPAN&gt;'org.apache.hadoop.hive.serde2.OpenCSVSerde'&lt;/SPAN&gt; 
&lt;SPAN&gt;WITH&lt;/SPAN&gt; &lt;SPAN&gt;SERDEPROPERTIES&lt;/SPAN&gt;&lt;SPAN&gt; ( &lt;/SPAN&gt;&lt;SPAN&gt;"separatorChar"&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;","&lt;/SPAN&gt;&lt;SPAN&gt; , &lt;/SPAN&gt;&lt;SPAN&gt;"quoteChar"&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;"\""&lt;/SPAN&gt;&lt;SPAN&gt;) &lt;/SPAN&gt;&lt;/PRE&gt;&lt;P&gt;Since the record is splitted amont two rows therefore it is not loading properly and gives null values. The output I am getting is mentioned below.&lt;/P&gt;&lt;TABLE width="201"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD width="58"&gt;IM43163&lt;/TD&gt;&lt;TD width="78"&gt;SOUTH,OFC&lt;/TD&gt;&lt;TD width="65"&gt;10-Jan-23&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;IM41763&lt;/TD&gt;&lt;TD&gt;?&lt;/TD&gt;&lt;TD&gt;?&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;?&lt;/TD&gt;&lt;TD&gt;?&lt;/TD&gt;&lt;TD&gt;?&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;</description>
      <pubDate>Tue, 21 Apr 2026 07:16:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-Line-Termination-in-Quotes/m-p/368671#M240232</guid>
      <dc:creator>Abdul_</dc:creator>
      <dc:date>2026-04-21T07:16:10Z</dc:date>
    </item>
    <item>
      <title>Re: Hive - Line Termination in Quotes</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-Line-Termination-in-Quotes/m-p/368752#M240254</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/104648"&gt;@Abdul_&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;It looks like the&amp;nbsp;&lt;SPAN&gt;issue is that the data contains a newline character (\n) within a field value, which causes the record to be split into two rows, causing the problem.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Can you modify the data to remove "\n" from the sample data? In that case, the create statement that you are using is correct.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;However, if data modification is impossible, you may use "LazySimpleSerDe". However, it may not be as performant as the OpenCSVSerde for large datasets.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;Hope this helps,&lt;/P&gt;&lt;P class="p1"&gt;Tarun&lt;/P&gt;&lt;P class="p1"&gt;&lt;I&gt;Was your question answered? Make sure to mark the answer as the accepted solution.&lt;/I&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;I&gt;If you find a reply useful, say thanks by clicking on the thumbs-up button.&lt;/I&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 18 Apr 2023 04:33:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-Line-Termination-in-Quotes/m-p/368752#M240254</guid>
      <dc:creator>tj2007</dc:creator>
      <dc:date>2023-04-18T04:33:27Z</dc:date>
    </item>
    <item>
      <title>Re: Hive - Line Termination in Quotes</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-Line-Termination-in-Quotes/m-p/368761#M240258</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/99744"&gt;@tj2007&lt;/a&gt;, It is not possible to modify the data. I have tried the "&lt;SPAN&gt;LazySimpleSerDe&lt;/SPAN&gt;" but it didn't give the correct output (Mentioned below).&lt;/P&gt;&lt;PRE&gt;&lt;SPAN&gt;ROW&lt;/SPAN&gt; &lt;SPAN&gt;FORMAT&lt;/SPAN&gt; &lt;SPAN&gt;SERDE&lt;/SPAN&gt; 
  &lt;SPAN&gt;'org.apache.hadoop.hive.serde2.OpenCSVSerde'&lt;/SPAN&gt;
&lt;SPAN&gt;with&lt;/SPAN&gt; &lt;SPAN&gt;serdeproperties&lt;/SPAN&gt;&lt;SPAN&gt; (
    &lt;/SPAN&gt;&lt;SPAN&gt;"separatorChar"&lt;/SPAN&gt; &lt;SPAN&gt;=&lt;/SPAN&gt; &lt;SPAN&gt;","&lt;/SPAN&gt;&lt;SPAN&gt;,
    &lt;/SPAN&gt;&lt;SPAN&gt;"quoteChar"&lt;/SPAN&gt;     &lt;SPAN&gt;=&lt;/SPAN&gt; &lt;SPAN&gt;'\""'&lt;/SPAN&gt;&lt;SPAN&gt;)      &lt;/SPAN&gt;&lt;SPAN&gt;STORED&lt;/SPAN&gt; &lt;SPAN&gt;AS&lt;/SPAN&gt; &lt;SPAN&gt;TEXTFILE&lt;/SPAN&gt;&lt;/PRE&gt;&lt;TABLE width="353"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD width="250"&gt;"IM43163","SOUTH,OFC","10-Jan-23"&lt;/TD&gt;&lt;TD width="78"&gt;?&lt;/TD&gt;&lt;TD width="25"&gt;?&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;"IM41763","John:&lt;/TD&gt;&lt;TD&gt;?&lt;/TD&gt;&lt;TD&gt;?&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;comment added","12-Jan-23"&lt;/TD&gt;&lt;TD&gt;?&lt;/TD&gt;&lt;TD&gt;?&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, I need output like this.&lt;/P&gt;&lt;TABLE width="458"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD width="250"&gt;IM43163&lt;/TD&gt;&lt;TD width="143"&gt;SOUTH,OFC&lt;/TD&gt;&lt;TD width="65"&gt;10-Jan-23&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;IM41763&lt;/TD&gt;&lt;TD&gt;John:comment added&lt;/TD&gt;&lt;TD&gt;12-Jan-23&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;Please also note that the input file is a CSV file which I am successfully able to open it in Excel. Your support will be highly appreciated.&lt;/P&gt;</description>
      <pubDate>Tue, 18 Apr 2023 06:07:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-Line-Termination-in-Quotes/m-p/368761#M240258</guid>
      <dc:creator>Abdul_</dc:creator>
      <dc:date>2023-04-18T06:07:44Z</dc:date>
    </item>
    <item>
      <title>Re: Hive - Line Termination in Quotes</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Hive-Line-Termination-in-Quotes/m-p/372755#M241372</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/104648"&gt;@Abdul_&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;As of now hive won't support row delimiter other new line character . Attaching the corresponding Jira for reference&amp;nbsp;&lt;A href="https://issues.apache.org/jira/browse/HIVE-11996" target="_self"&gt;HIVE-11996&lt;/A&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;As a workaround, Recommend to update the input file using external libraries like awk,...etc and upload the input file in the corresponding FileSystem location to read.&amp;nbsp;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Eg -&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Through AWK&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;[root@c2757-node2 ~]# awk -F "\",\"" 'NF &amp;lt; 3 {getline nextline; $0 = $0 nextline} 1' sample_case.txt
"IM43163","SOUTH,OFC","10-Jan-23"
"IM41763","John:comment added","12-Jan-23"
[root@c2757-node2 ~]# awk -F "\",\"" 'NF &amp;lt; 3 {getline nextline; $0 = $0 nextline} 1' sample_case.txt  &amp;gt; sample_text.csv&lt;/LI-CODE&gt;&lt;P&gt;&lt;BR /&gt;Reading from Hive Table&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;0: jdbc:hive2://c2757-node2.coelab.cloudera.c&amp;gt; select * from table1;
.
.
.
INFO  : Executing command(queryId=hive_20230616064136_333ff98d-636b-43b1-898d-fca66031fe7f): select * from table1
INFO  : Completed executing command(queryId=hive_20230616064136_333ff98d-636b-43b1-898d-fca66031fe7f); Time taken: 0.023 seconds
INFO  : OK
+---------------+---------------------+---------------+
| table1.col_1  |    table1.col_2     | table1.col_3  |
+---------------+---------------------+---------------+
| IM43163       | SOUTH,OFC           | 10-Jan-23     |
| IM41763       | John:comment added  | 12-Jan-23     |
+---------------+---------------------+---------------+
2 rows selected (1.864 seconds)&lt;/LI-CODE&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Jun 2023 06:47:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Hive-Line-Termination-in-Quotes/m-p/372755#M241372</guid>
      <dc:creator>ggangadharan</dc:creator>
      <dc:date>2023-06-16T06:47:42Z</dc:date>
    </item>
  </channel>
</rss>

