<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question how to split name based on special character in hive in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/how-to-split-name-based-on-special-character-in-hive/m-p/242004#M203807</link>
    <description>&lt;P&gt;I have a table with column name "fullname" which I would like to split into four columns (LAST_NAME, TITLE, FIRST_NAME, MIDDLE_NAME) while loading into another table.&lt;/P&gt;&lt;P&gt; If it is a person's name, the convention is LAST_NAME TITLE! FIRST_NAME MIDDLE_NAME: &lt;/P&gt;&lt;P&gt;
for example I have "abc xxx! def ghi" in my table. This should be split and loaded in to 4 different columns. &lt;/P&gt;&lt;P&gt;the name should be split into LAST, FIRST, MID. Word Preceding the exclamation is TITLE (like xxx here). &lt;/P&gt;&lt;P&gt;If it is an organisation name ("Names of organisations are ended by an exclamation mark) : I should move the entire string to FIRST_NAME. &lt;/P&gt;&lt;P&gt;
for example: abc systems! should be loaded to FIRST_NAME. &lt;/P&gt;&lt;P&gt;current state:&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;fullname&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;abc xxx! def ghi&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;abc systems!&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;expected result:&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;Last&lt;/TD&gt;&lt;TD&gt;Title&lt;/TD&gt;&lt;TD&gt;First&lt;/TD&gt;&lt;TD&gt;Middle&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;abc&lt;/TD&gt;&lt;TD&gt;xxx&lt;/TD&gt;&lt;TD&gt;def&lt;/TD&gt;&lt;TD&gt;ghi&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;/TD&gt;&lt;TD&gt;&lt;/TD&gt;&lt;TD&gt;abc systems&lt;/TD&gt;&lt;TD&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;Can someone help how to write a query for the above requirement? Thanks in Advance!&lt;/P&gt;</description>
    <pubDate>Fri, 11 Jan 2019 22:36:24 GMT</pubDate>
    <dc:creator>madankumarpuril</dc:creator>
    <dc:date>2019-01-11T22:36:24Z</dc:date>
  </channel>
</rss>

