<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Sqoop running on closed cluster (only edge server has access) in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Sqoop-running-on-closed-cluster-only-edge-server-has-access/m-p/103516#M46269</link>
    <description>&lt;P&gt;Hi. &lt;A rel="user" href="https://community.cloudera.com/users/13370/mmazerolle.html" nodeid="13370"&gt;@Marc Mazerolle&lt;/A&gt;. sqoop runs by executing the jdbc on in a mapper on the data nodes, not on the edge node. That's what makes it so fast - multiple hosts pulling data. That means the data nodes need to talk to your sql server on 1433. I'm not sure about your port forwarding workaround, but I assume you understand that using netcat (nc) to do port forwarding is not a good solution for anything but proof of concept for multiple reasons. And, with netcat, I think it would only work for a single mapper since netcat can only handle one connection at a time, so if you had multiple mappers, they could't get through netcat. I think you may need to open the port between your data nodes and the sql server.&lt;/P&gt;</description>
    <pubDate>Wed, 16 Nov 2016 02:27:13 GMT</pubDate>
    <dc:creator>james_jones</dc:creator>
    <dc:date>2016-11-16T02:27:13Z</dc:date>
    <item>
      <title>Sqoop running on closed cluster (only edge server has access)</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Sqoop-running-on-closed-cluster-only-edge-server-has-access/m-p/103515#M46268</link>
      <description>&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;I have a small cluster (10 machines now). One edge server that has two network cards (one on the internal network 142.39.41.*, the other sees the cluster at 10.1.1.*), a management server and 8 data nodes all on the 10.1.1.* network.&lt;/P&gt;&lt;P&gt;Sqoop is on the edge server but when I try to import a single table from a sql server database (on 142.39.41.*) &lt;/P&gt;&lt;PRE&gt;sqoop import \
  --connect 'jdbc:sqlserver://dbserver;DatabaseName=MyDB;user=XXXXXXXXX;password=XXXXXXX;port=1433' \
  --table=dbo.Asset \
  --driver com.microsoft.sqlserver.jdbc.SQLServerDriver \
  -m 1
&lt;/PRE&gt;&lt;P&gt;I get :&lt;/P&gt;&lt;PRE&gt;Caused by: com.microsoft.sqlserver.jdbc.SQLServerException: The TCP/IP connection to the host devsql94, port 1433 has failed. Error: "null. Verify the connection properties. Make sure that an instance of SQL Server is running on the host and accepting TCP/IP connections at the port. Make sure that TCP connections to the port are not blocked by a firewall.".
&lt;/PRE&gt;&lt;P&gt;When i try to list the tables in the database using Sqoop &lt;/P&gt;&lt;PRE&gt;sqoop list-tables --connect 'jdbc:sqlserver://dbserver;DatabaseName=MyDB;user=XXXXXXXXX;password=XXXXXXX;port=1433' --driver   com.microsoft.sqlserver.jdbc.SQLServerDriver&lt;/PRE&gt;&lt;P&gt;it works fine and list all my tables so the JDBC access from the edge server to the database works.&lt;/P&gt;&lt;P&gt;Wrapping my head around the problem I started to think that Sqoop might be sending the jobs to another node (?) to handle the database reading. But which node ?&lt;/P&gt;&lt;P&gt;So I tried to port forwarding the edge localport 1433 to the sql server database port 1433 using "nc" as per &lt;A target="_blank" href="https://29a.ch/2009/5/10/forwarding-ports-using-netcat"&gt;this site&lt;/A&gt; but it didn't work either.&lt;/P&gt;&lt;P&gt;Can anyone figure this one out ? Is my architecture setup wrong by allowing only the edge server to see the corporate network ?&lt;/P&gt;</description>
      <pubDate>Tue, 15 Nov 2016 21:07:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Sqoop-running-on-closed-cluster-only-edge-server-has-access/m-p/103515#M46268</guid>
      <dc:creator>mmazerolle</dc:creator>
      <dc:date>2016-11-15T21:07:21Z</dc:date>
    </item>
    <item>
      <title>Re: Sqoop running on closed cluster (only edge server has access)</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Sqoop-running-on-closed-cluster-only-edge-server-has-access/m-p/103516#M46269</link>
      <description>&lt;P&gt;Hi. &lt;A rel="user" href="https://community.cloudera.com/users/13370/mmazerolle.html" nodeid="13370"&gt;@Marc Mazerolle&lt;/A&gt;. sqoop runs by executing the jdbc on in a mapper on the data nodes, not on the edge node. That's what makes it so fast - multiple hosts pulling data. That means the data nodes need to talk to your sql server on 1433. I'm not sure about your port forwarding workaround, but I assume you understand that using netcat (nc) to do port forwarding is not a good solution for anything but proof of concept for multiple reasons. And, with netcat, I think it would only work for a single mapper since netcat can only handle one connection at a time, so if you had multiple mappers, they could't get through netcat. I think you may need to open the port between your data nodes and the sql server.&lt;/P&gt;</description>
      <pubDate>Wed, 16 Nov 2016 02:27:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Sqoop-running-on-closed-cluster-only-edge-server-has-access/m-p/103516#M46269</guid>
      <dc:creator>james_jones</dc:creator>
      <dc:date>2016-11-16T02:27:13Z</dc:date>
    </item>
    <item>
      <title>Re: Sqoop running on closed cluster (only edge server has access)</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Sqoop-running-on-closed-cluster-only-edge-server-has-access/m-p/103517#M46270</link>
      <description>&lt;P&gt;Or bridge the edge server.&lt;/P&gt;</description>
      <pubDate>Wed, 16 Nov 2016 02:44:59 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Sqoop-running-on-closed-cluster-only-edge-server-has-access/m-p/103517#M46270</guid>
      <dc:creator>mmazerolle</dc:creator>
      <dc:date>2016-11-16T02:44:59Z</dc:date>
    </item>
  </channel>
</rss>

