<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Best practice for exporting oracle rdbms to hive in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-practice-for-exporting-oracle-rdbms-to-hive/m-p/119915#M38840</link>
    <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/2591/adija-1.html" nodeid="2591"&gt;@Adi Jabkowsky&lt;/A&gt;. Take a look at the QueryDatabaseTable processor. &lt;A href="https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.standard.QueryDatabaseTable/index.html" target="_blank"&gt;https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.standard.QueryDatabaseTable/index.html&lt;/A&gt;. You can also use the ExecuteSQL processor &lt;A href="https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.standard.ExecuteSQL/index.html" target="_blank"&gt;https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.standard.ExecuteSQL/index.html&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 26 Aug 2016 01:04:15 GMT</pubDate>
    <dc:creator>SQLShaw</dc:creator>
    <dc:date>2016-08-26T01:04:15Z</dc:date>
    <item>
      <title>Best practice for exporting oracle rdbms to hive</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-practice-for-exporting-oracle-rdbms-to-hive/m-p/119914#M38839</link>
      <description>&lt;P&gt;Hi all&lt;/P&gt;&lt;P&gt;I'm using nifi 1.0 and need to export data from an Oracle rdbms and insert the data into an Orc table on hive. &lt;/P&gt;&lt;P&gt;What is the best way to do that (using what processors) using nothing but NiFi? &lt;/P&gt;&lt;P&gt;To be even more exact if I have the source data as csv then it's not a problem, however It seems that I have to use a custom script for exporting the data to csv from the rdbms. I'd rather use NiFi processors but can't find the suitable to do so. &lt;/P&gt;&lt;P&gt;Any suggestions? &lt;/P&gt;&lt;P&gt;Adi&lt;/P&gt;</description>
      <pubDate>Fri, 26 Aug 2016 00:54:53 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-practice-for-exporting-oracle-rdbms-to-hive/m-p/119914#M38839</guid>
      <dc:creator>Adija1</dc:creator>
      <dc:date>2016-08-26T00:54:53Z</dc:date>
    </item>
    <item>
      <title>Re: Best practice for exporting oracle rdbms to hive</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-practice-for-exporting-oracle-rdbms-to-hive/m-p/119915#M38840</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/2591/adija-1.html" nodeid="2591"&gt;@Adi Jabkowsky&lt;/A&gt;. Take a look at the QueryDatabaseTable processor. &lt;A href="https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.standard.QueryDatabaseTable/index.html" target="_blank"&gt;https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.standard.QueryDatabaseTable/index.html&lt;/A&gt;. You can also use the ExecuteSQL processor &lt;A href="https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.standard.ExecuteSQL/index.html" target="_blank"&gt;https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.standard.ExecuteSQL/index.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 26 Aug 2016 01:04:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-practice-for-exporting-oracle-rdbms-to-hive/m-p/119915#M38840</guid>
      <dc:creator>SQLShaw</dc:creator>
      <dc:date>2016-08-26T01:04:15Z</dc:date>
    </item>
    <item>
      <title>Re: Best practice for exporting oracle rdbms to hive</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-practice-for-exporting-oracle-rdbms-to-hive/m-p/119916#M38841</link>
      <description>&lt;P&gt;To add to Scott's answer, you can use QueryDatabaseTable (for a one-time export, if you choose a "maximum value column" like the primary key column) into a ConvertAvroToORC processor (available in the 1.0 GA release), then a PutHDFS processor to get the data into Hadoop. If the table has not been created, ConvertAvroToORC generates partial Hive DDL in an attribute (CREATE TABLE IF NOT EXISTS...), so after PutHDFS you could have a ReplaceText processor to put the DDL statement (along with the file's HDFS location) into the flow file, then send that to a PutHiveQL processor, which would execute the DDL statement, creating the table atop the directory containing your file(s) in HDFS.&lt;/P&gt;&lt;P&gt;That might sound a bit complicated, but it is flexible and powerful. I will post a template to the NiFi wiki after 1.0 is released, showing how such a flow would work.&lt;/P&gt;</description>
      <pubDate>Fri, 26 Aug 2016 01:44:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-practice-for-exporting-oracle-rdbms-to-hive/m-p/119916#M38841</guid>
      <dc:creator>mburgess</dc:creator>
      <dc:date>2016-08-26T01:44:22Z</dc:date>
    </item>
    <item>
      <title>Re: Best practice for exporting oracle rdbms to hive</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-practice-for-exporting-oracle-rdbms-to-hive/m-p/119917#M38842</link>
      <description>&lt;P&gt;Thank you both so much for the response!
The querying part using the processors you recommended is pretty straightforward - the problem is - what to use after the querying ? How do i transform the data into csv &lt;STRONG&gt;or &lt;/STRONG&gt;other format which i can insert into hadoop ?
There is no ConvertAvroToCSV processor and ConvertAvroToOrc is not available in the 1.0 Beta release. Where can i get the 1.0 GA release ??&lt;/P&gt;</description>
      <pubDate>Sun, 28 Aug 2016 13:23:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Best-practice-for-exporting-oracle-rdbms-to-hive/m-p/119917#M38842</guid>
      <dc:creator>Adija1</dc:creator>
      <dc:date>2016-08-28T13:23:09Z</dc:date>
    </item>
  </channel>
</rss>

