<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Data ingestion from MSSQL server to HDFS? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Data-ingestion-from-MSSQL-server-to-HDFS/m-p/118978#M81761</link>
    <description>&lt;P&gt;+1 on suggestion #1&lt;/P&gt;</description>
    <pubDate>Thu, 24 Nov 2016 08:17:31 GMT</pubDate>
    <dc:creator>LesterMartin</dc:creator>
    <dc:date>2016-11-24T08:17:31Z</dc:date>
    <item>
      <title>Data ingestion from MSSQL server to HDFS?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Data-ingestion-from-MSSQL-server-to-HDFS/m-p/118976#M81759</link>
      <description>&lt;P&gt;I want to ingest some data from MSSQL into HDFS periodically for example every week. I only have access to a couple of stored procedures on the Sql server. Is there possibility of calling stored-procedure in Sqoop?&lt;/P&gt;</description>
      <pubDate>Wed, 23 Nov 2016 21:54:19 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Data-ingestion-from-MSSQL-server-to-HDFS/m-p/118976#M81759</guid>
      <dc:creator>mashayekhi_m</dc:creator>
      <dc:date>2016-11-23T21:54:19Z</dc:date>
    </item>
    <item>
      <title>Re: Data ingestion from MSSQL server to HDFS?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Data-ingestion-from-MSSQL-server-to-HDFS/m-p/118977#M81760</link>
      <description>&lt;P&gt;Unfortunately when importing, Sqoop cannot access stored procedures on the source system -- you will have to implement the processing logic on the hadoop side.  To do this, you have three main choices:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;ingest the raw data in a landing zone and use pig to transform (implementing the stored proc logic) into your target hive table.  Note that landing the data raw is a best practice in hadoop ... there is a good chance you may want this raw data for activities elsewhere (like reporting or data science) and storage is cheap.&lt;/LI&gt;&lt;LI&gt;same as above but implement Hive HPL/SQL which is a procedural sql language for hive
&lt;A href="https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=59690156" target="_blank"&gt;https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=59690156&lt;/A&gt;&lt;/LI&gt;&lt;LI&gt;same as above but use a 3rd party tool like Syncsort's DMX-h
&lt;A href="https://community.hortonworks.com/answers/list.html?sort=newest&amp;amp;customPageSize=true" target="_blank"&gt;https://community.hortonworks.com/answers/list.html?sort=newest&amp;amp;customPageSize=true&lt;/A&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;Notes&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;there are advantages to offloading the stored proc processing to hadoop:
&lt;UL&gt;&lt;LI&gt;it typically takes much less time on hadoop (parallel processing)&lt;/LI&gt;&lt;LI&gt;it frees resources on your source system and thus improves performance on that side&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;LI&gt;when &lt;EM&gt;&lt;STRONG&gt;exporting&lt;/STRONG&gt;&lt;/EM&gt; from hadoop to RDBMS you CAN trigger a stored procedure on the RDBMS side&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;EM&gt;If this is what you are looking for, let me know by accepting the answer; else, please follow up with any remaining gaps.&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 23 Nov 2016 23:54:19 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Data-ingestion-from-MSSQL-server-to-HDFS/m-p/118977#M81760</guid>
      <dc:creator>gkeys</dc:creator>
      <dc:date>2016-11-23T23:54:19Z</dc:date>
    </item>
    <item>
      <title>Re: Data ingestion from MSSQL server to HDFS?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Data-ingestion-from-MSSQL-server-to-HDFS/m-p/118978#M81761</link>
      <description>&lt;P&gt;+1 on suggestion #1&lt;/P&gt;</description>
      <pubDate>Thu, 24 Nov 2016 08:17:31 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Data-ingestion-from-MSSQL-server-to-HDFS/m-p/118978#M81761</guid>
      <dc:creator>LesterMartin</dc:creator>
      <dc:date>2016-11-24T08:17:31Z</dc:date>
    </item>
  </channel>
</rss>

