<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Running distcp between two cluster: One Kerberized and the other is not in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/394241#M248681</link>
    <description>&lt;P&gt;非安全集群被阻止rpc通信，使用webhdfs协议，hadoop distcp -D ipc.client.fallback-to-simple-auth-allowed=true webhdfs://nn1:50070/foo/bar hdfs://nn2:8020/bar/foo&lt;/P&gt;</description>
    <pubDate>Mon, 30 Sep 2024 14:41:51 GMT</pubDate>
    <dc:creator>irunker</dc:creator>
    <dc:date>2024-09-30T14:41:51Z</dc:date>
    <item>
      <title>Running distcp between two cluster: One Kerberized and the other is not</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94368#M57790</link>
      <description>&lt;P&gt;hadoop distcp  -i -log /tmp/ hdfs://xxx:8020/apps/yyyy hdfs://xxx_cid/tmp/&lt;/P&gt;&lt;P&gt;In this case the "xxx" is the "un-secure" cluster, while "xxx_cid" in the secure cluster.&lt;/P&gt;&lt;P&gt;We are launching the job from the Kerberos cluster, with the appropriate kinit for the user and getting the following error:&lt;/P&gt;&lt;P&gt;java.io.IOException: Failed on local exception: java.io.IOException: Server asks us to fall back to SIMPLE auth, but this client is configured to only allow secure connections.; Host Details : local host is: "xxx&lt;A href="http://abhdp-dn-a01.marketo.org/10.1.32.7"&gt;/10.x.x.&lt;/A&gt;x"; destination host is: "xxx":8020;&lt;/P&gt;&lt;P&gt;...&lt;/P&gt;&lt;P&gt;Caused by: java.io.IOException: Server asks us to fall back to SIMPLE auth, but this client is configured to only allow secure connections.&lt;/P&gt;&lt;P&gt;I thought by launching the job from the secure cluster, that we could avoid any access issues.  But it appears that the processes are kicked off from the "source" cluster.  In this case, that's the insecure cluster.&lt;/P&gt;&lt;P&gt;Idea's on getting around this?&lt;/P&gt;</description>
      <pubDate>Sat, 26 Sep 2015 00:58:11 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94368#M57790</guid>
      <dc:creator>dstreev</dc:creator>
      <dc:date>2015-09-26T00:58:11Z</dc:date>
    </item>
    <item>
      <title>Re: Running distcp between two cluster: One Kerberized and the other is not</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94369#M57791</link>
      <description>&lt;P&gt;sounds like the distcp process is running secure, but is configured to not like simple connections.&lt;/P&gt;&lt;P&gt;try setting the config option&lt;/P&gt;&lt;PRE&gt;ipc.client.fallback-to-simple-auth-allowed=true&lt;/PRE&gt;</description>
      <pubDate>Sat, 26 Sep 2015 17:28:37 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94369#M57791</guid>
      <dc:creator>stevel</dc:creator>
      <dc:date>2015-09-26T17:28:37Z</dc:date>
    </item>
    <item>
      <title>Re: Running distcp between two cluster: One Kerberized and the other is not</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94370#M57792</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/175/dstreever.html" nodeid="175"&gt;@dstreever@hortonworks.com&lt;/A&gt; To use Distcp for copying between a secure cluster and an insecure one, add the following to the HDFS core-default.xml, by using Ambari. &lt;/P&gt;&lt;PRE&gt;&amp;lt;property&amp;gt;
  &amp;lt;name&amp;gt;ipc.client.fallback-to-simple-auth-allowed&amp;lt;/name&amp;gt;
  &amp;lt;value&amp;gt;true&amp;lt;/value&amp;gt; 
&amp;lt;/property&amp;gt;&lt;/PRE&gt;</description>
      <pubDate>Tue, 29 Sep 2015 17:20:59 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94370#M57792</guid>
      <dc:creator>pardeep_kumar</dc:creator>
      <dc:date>2015-09-29T17:20:59Z</dc:date>
    </item>
    <item>
      <title>Re: Running distcp between two cluster: One Kerberized and the other is not</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94371#M57793</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/139/pardeepkumar.html" nodeid="139"&gt;@Pardeep&lt;/A&gt; Nice find!  Link&lt;/P&gt;&lt;P&gt;When copying data from a secure cluster to a secure cluster, the following configuration setting is required in the &lt;CODE&gt;core-site.xml&lt;/CODE&gt; file:&lt;/P&gt;&lt;PRE&gt;&amp;lt;property&amp;gt;
    &amp;lt;name&amp;gt;hadoop.security.auth_to_local&amp;lt;/name&amp;gt;
    &amp;lt;value&amp;gt;&amp;lt;/value&amp;gt;
    &amp;lt;description&amp;gt;Maps kerberos principals to local user names&amp;lt;/description&amp;gt;
&amp;lt;/property&amp;gt; &lt;/PRE&gt;</description>
      <pubDate>Thu, 05 Nov 2015 00:09:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94371#M57793</guid>
      <dc:creator>nsabharwal</dc:creator>
      <dc:date>2015-11-05T00:09:06Z</dc:date>
    </item>
    <item>
      <title>Re: Running distcp between two cluster: One Kerberized and the other is not</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94372#M57794</link>
      <description>&lt;P&gt;I recommend not setting this in core-site.xml, and instead setting it on the command line invocation specifically for the DistCp command that needs to communicate with the unsecured cluster.  Setting it in core-site.xml means that all RPC connections for any application are eligible for fallback to simple authentication.  This potentially expands the attack surface for man-in-the-middle attacks.&lt;/P&gt;&lt;P&gt;Here is an example of overriding the setting on the command line while running DistCp:&lt;/P&gt;&lt;PRE&gt;hadoop distcp -D ipc.client.fallback-to-simple-auth-allowed=true hdfs://nn1:8020/foo/bar hdfs://nn2:8020/bar/foo&lt;/PRE&gt;&lt;P&gt;The command must be run while logged into the secured cluster, not the unsecured cluster.&lt;/P&gt;</description>
      <pubDate>Thu, 19 Nov 2015 06:25:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94372#M57794</guid>
      <dc:creator>cnauroth</dc:creator>
      <dc:date>2015-11-19T06:25:21Z</dc:date>
    </item>
    <item>
      <title>Re: Running distcp between two cluster: One Kerberized and the other is not</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94373#M57795</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/381/cnauroth.html" nodeid="381"&gt;@Chris Nauroth&lt;/A&gt; Thanks for sharing this. Could you update the answer with more details? I believe this is the best answer if you can add more details. &lt;/P&gt;</description>
      <pubDate>Sat, 21 Nov 2015 22:46:29 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94373#M57795</guid>
      <dc:creator>nsabharwal</dc:creator>
      <dc:date>2015-11-21T22:46:29Z</dc:date>
    </item>
    <item>
      <title>Re: Running distcp between two cluster: One Kerberized and the other is not</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94374#M57796</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/140/nsabharwal.html" nodeid="140"&gt;@Neeraj Sabharwal&lt;/A&gt;, thank you.  I updated the answer to show an example of overriding the property from the DistCp command line.&lt;/P&gt;</description>
      <pubDate>Tue, 24 Nov 2015 05:49:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94374#M57796</guid>
      <dc:creator>cnauroth</dc:creator>
      <dc:date>2015-11-24T05:49:42Z</dc:date>
    </item>
    <item>
      <title>Re: Running distcp between two cluster: One Kerberized and the other is not</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94375#M57797</link>
      <description>&lt;P&gt;Thanks &lt;A rel="user" href="https://community.cloudera.com/users/381/cnauroth.html" nodeid="381"&gt;@Chris Nauroth&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 24 Nov 2015 07:37:11 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94375#M57797</guid>
      <dc:creator>nsabharwal</dc:creator>
      <dc:date>2015-11-24T07:37:11Z</dc:date>
    </item>
    <item>
      <title>Re: Running distcp between two cluster: One Kerberized and the other is not</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94376#M57798</link>
      <description>&lt;PRE&gt;&lt;/PRE&gt;&lt;OL&gt;&lt;LI&gt;getting below error after running the command "hadoop distcp -D ipc.client.fallback-to-simple-auth-allowed=true hdfs://nn1:8020/foo/bar hdfs://nn2:8020/bar/foo"&lt;/LI&gt;&lt;LI&gt;java.io.EOFException:End of FileException between local host is***; destination host is:***;&lt;/LI&gt;&lt;LI&gt;please suggest&lt;/LI&gt;&lt;/OL&gt;</description>
      <pubDate>Wed, 16 Mar 2016 23:35:16 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94376#M57798</guid>
      <dc:creator>hari_prsd14</dc:creator>
      <dc:date>2016-03-16T23:35:16Z</dc:date>
    </item>
    <item>
      <title>Re: Running distcp between two cluster: One Kerberized and the other is not</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94377#M57799</link>
      <description>&lt;PRE&gt;&lt;/PRE&gt;&lt;OL&gt;&lt;LI&gt;getting below error after running the command "hadoop distcp -D ipc.client.fallback-to-simple-auth-allowed=true hdfs://nn1:8020/foo/bar hdfs://nn2:8020/bar/foo"&lt;/LI&gt;&lt;LI&gt;java.io.EOFException:End of FileException between local host is***; destination host is:***;&lt;/LI&gt;&lt;LI&gt;please suggest&lt;/LI&gt;&lt;/OL&gt;</description>
      <pubDate>Wed, 16 Mar 2016 23:36:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94377#M57799</guid>
      <dc:creator>hari_prsd14</dc:creator>
      <dc:date>2016-03-16T23:36:45Z</dc:date>
    </item>
    <item>
      <title>Re: Running distcp between two cluster: One Kerberized and the other is not</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94378#M57800</link>
      <description>&lt;PRE&gt;getting below error after running the command "hadoop distcp -D ipc.client.fallback-to-simple-auth-allowed=true hdfs://nn1:8020/foo/bar hdfs://nn2:8020/bar/foo"

java.io.EOFException: End of File Exception between local host is *** ; destination host is:***;


please suggest&lt;/PRE&gt;</description>
      <pubDate>Wed, 16 Mar 2016 23:44:46 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94378#M57800</guid>
      <dc:creator>hari_prsd14</dc:creator>
      <dc:date>2016-03-16T23:44:46Z</dc:date>
    </item>
    <item>
      <title>Re: Running distcp between two cluster: One Kerberized and the other is not</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94379#M57801</link>
      <description>&lt;P&gt;I have found that using webhdfs instead of hdfs for the nonsecure host gets around this error, e.g.,&lt;/P&gt;&lt;PRE&gt;"hadoop distcp -D ipc.client.fallback-to-simple-auth-allowed=true webhdfs://nn1:8020/foo/bar hdfs://nn2:8020/bar/foo"&lt;/PRE&gt;</description>
      <pubDate>Fri, 13 May 2016 21:09:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94379#M57801</guid>
      <dc:creator>kshaffer_tpgs</dc:creator>
      <dc:date>2016-05-13T21:09:52Z</dc:date>
    </item>
    <item>
      <title>Re: Running distcp between two cluster: One Kerberized and the other is not</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94380#M57802</link>
      <description>&lt;P&gt;can you try this&lt;/P&gt;&lt;P&gt;hadoop distcp –D
ipc.client.falback-tosimple-auth-allowed=true webhdfs://&lt;EM&gt;insecureCluster&lt;/EM&gt;
webhdfs://&lt;EM&gt;secureCluster&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 25 May 2016 20:12:53 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94380#M57802</guid>
      <dc:creator>saramakr</dc:creator>
      <dc:date>2016-05-25T20:12:53Z</dc:date>
    </item>
    <item>
      <title>Re: Running distcp between two cluster: One Kerberized and the other is not</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94381#M57803</link>
      <description>&lt;P&gt;We had similar issue. &lt;/P&gt;&lt;P&gt;Most likely you have following property in /etc/hadoop/conf/hdfs-site.xml:&lt;/P&gt;&lt;PRE&gt;&amp;lt;property&amp;gt;
  &amp;lt;name&amp;gt;dfs.namenode.acls.enabled&amp;lt;/name&amp;gt;
  &amp;lt;value&amp;gt;true&amp;lt;/value&amp;gt;
  &amp;lt;/property&lt;/PRE&gt;&lt;P&gt;Remove this property or set it to "false". It should help.&lt;/P&gt;</description>
      <pubDate>Wed, 08 Mar 2017 04:28:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94381#M57803</guid>
      <dc:creator>leonidzav</dc:creator>
      <dc:date>2017-03-08T04:28:48Z</dc:date>
    </item>
    <item>
      <title>Re: Running distcp between two cluster: One Kerberized and the other is not</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94382#M57804</link>
      <description>&lt;P&gt;Am doing distcp from insecure to secure hadop cluster and am getting error "SIMPLE authentication isnot enabled". Can any one suggest.&lt;/P&gt;&lt;P&gt;hdfs@master02:~&amp;gt; hadoop distcp -Dipc.client.fallback-to-simple-auth-allowed=true hdfs://HDP23:8020/test01.txt hdfs://HDP24:8020/&lt;/P&gt;&lt;P&gt;17/04/0500:09:28 ERROR tools.DistCp:Invalid arguments:org.apache.hadoop.security.AccessControlException: SIMPLE authentication isnot enabled.Available:[TOKEN, KERBEROS]&lt;/P&gt;</description>
      <pubDate>Wed, 05 Apr 2017 02:03:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94382#M57804</guid>
      <dc:creator>turing_nix</dc:creator>
      <dc:date>2017-04-05T02:03:10Z</dc:date>
    </item>
    <item>
      <title>Re: Running distcp between two cluster: One Kerberized and the other is not</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94383#M57805</link>
      <description>&lt;P&gt;Adding this property in core-site.xml helped resolve the error.&lt;/P&gt;</description>
      <pubDate>Wed, 28 Mar 2018 06:23:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94383#M57805</guid>
      <dc:creator>jkiran2020</dc:creator>
      <dc:date>2018-03-28T06:23:18Z</dc:date>
    </item>
    <item>
      <title>Re: Running distcp between two cluster: One Kerberized and the other is not</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94384#M57806</link>
      <description>&lt;P&gt;&lt;A rel="noopener noreferrer noopener noreferrer" href="https://litebluee.us/" target="_blank"&gt;LiteBlue&lt;/A&gt; is the U.S. Government website this is intended for authorized use only by Postal Service employees&lt;/P&gt;</description>
      <pubDate>Wed, 17 Apr 2019 13:21:55 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/94384#M57806</guid>
      <dc:creator>badmanlive085</dc:creator>
      <dc:date>2019-04-17T13:21:55Z</dc:date>
    </item>
    <item>
      <title>Re: Running distcp between two cluster: One Kerberized and the other is not</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/394241#M248681</link>
      <description>&lt;P&gt;非安全集群被阻止rpc通信，使用webhdfs协议，hadoop distcp -D ipc.client.fallback-to-simple-auth-allowed=true webhdfs://nn1:50070/foo/bar hdfs://nn2:8020/bar/foo&lt;/P&gt;</description>
      <pubDate>Mon, 30 Sep 2024 14:41:51 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Running-distcp-between-two-cluster-One-Kerberized-and-the/m-p/394241#M248681</guid>
      <dc:creator>irunker</dc:creator>
      <dc:date>2024-09-30T14:41:51Z</dc:date>
    </item>
  </channel>
</rss>

